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Abstract 

In this paper we introduce a general framework for casting fully dy- 
namic transitive closure into the problem of reevaluating polynomials over 
matrices. With this technique, we improve the best known bounds for 
fully dynamic transitive closure. In particular, we devise a deterministic 
algorithm for general directed graphs that achieves 0(n 2 ) amortized time 
for updates, while preserving unit worst-case cost for queries. In case of 
deletions only, our algorithm performs updates faster in 0(n) amortized 
time. 

Our matrix-based approach yields an algorithm for directed acyclic 
graphs that breaks through the 0(n 2 ) barrier on the single-operation com- 
plexity of fully dynamic transitive closure. We can answer queries in 0(n e ) 
time and perform updates in 0(n a, ( 1 ' e,1 )~ e + n 1+e ) time, for any e € [0, 1], 
where uj(1, e, 1) is the exponent of the multiplication of an nxn e matrix by 
an n e x n matrix. The current best bounds on ui(l, e, 1) imply an 0(n 0,58 ) 
query time and an 0(n 1,58 ) update time. Our subquadratic algorithm is 
randomized, and has one-side error. 
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1 Introduction 



In this paper we present fully dynamic algorithms for maintaining the transitive 
closure of a directed graph. A dynamic graph algorithm maintains a given prop- 
erty on a graph subject to dynamic changes, such as edge insertions and edge 
deletions. We say that an algorithm is fully dynamic if it can handle both edge 
insertions and edge deletions. A partially dynamic algorithm can handle either 
edge insertions or edge deletions, but not both: we say that it is incremental if 
it supports insertions only, and decremental if it supports deletions only. In the 
fully dynamic transitive closure problem we wish to maintain a directed graph 
G = (V, E) under an intermixed sequence of the following operations: 

Insert(x, y): insert an edge from x to y in G; 
Delete(x, y): delete the edge from x to y in G; 

Query(x, y): report yes if there is a path from x to y in G, and no otherwise. 

Throughout the paper, we denote by m and by n the number of edges and vertices 
in G, respectively. 

Research on dynamic transitive closure spans over two decades. Before de- 
scribing the results known, we list the bounds obtainable with simple-minded 
methods. If we do nothing during each update, then we have to explore the 
whole graph in order to answer reachability queries: this gives 0(n 2 ) time per 
query and 0(1) time per update in the worst case. On the other extreme, we 
could recompute the transitive closure from scratch after each update; as this 
task can be accomplished via matrix multiplication |T|, this approach yields 



0(1) time per query and 0(n w ) time per update in the worst case, where to is 
the best known exponent for matrix multiplication (currently uj < 2.38 0). 

Previous Work. For the incremental version of the problem, the first algo- 
rithm was proposed by Ibaraki and Katoh fl]J in 1983: its running time was 



0(n ) over any sequence of insertions. This bound was later improved to 0(n) 
amortized time per insertion by Italiano [jl!| and also by La Poutre and van 
Leeuwen [17]. Yellin [|TjJ gave an 0(m*5 max ) algorithm for m edge insertions, 
where m* is the number of edges in the final transitive closure and 5 max is the 
maximum out-degree of the final graph. All these algorithms maintain explicitly 
the transitive closure, and so their query time is 0(1). 

The first decremental algorithm was again given by Ibaraki and Katoh [fPH , 
with a running time of 0(n 2 ) per deletion. This was improved to 0(m) per 



deletion by La Poutre and van Leeuwen |17fl . Italiano |I3[ presented an algo- 
rithm that achieves 0(n) amortized time per deletion on directed acyclic graphs. 
Yellin |19|] gave an 0(m*5 max ) algorithm for m edge deletions, where m* is the 
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initial number of edges in the transitive closure and 5 max is the maximum out- 
degree of the initial graph. Again, the query time of all these algorithms is 0(1). 
More recently, Henzinger and King || gave a randomized decremental transitive 
closure algorithm for general directed graphs with a query time of 0(n/ log n) 
and an amortized update time of 0(n log 2 n). 

The first fully dynamic transitive closure algorithm was devised by Henzinger 
and King || in 1995: they gave a randomized Monte Carlo algorithm with one- 
side error supporting a query time of Oinj logra) and an amortized update time of 
0(nm 0,58 log n), where m is the average number of edges in the graph throughout 
the whole update sequence. Since m can be as high as 0(n 2 ), their update 
time is 0(n 2 ' 16 log 2 n). Khanna, Motwani and Wilson |14| proved that, when a 



lookahead of 6(n 018 ) in the updates is permitted, a deterministic update bound 
of 0(n 218 ) can be achieved. Very recently, King and Sagert [16| showed how to 



support queries in 0(1) time and updates in 0(n 2 ' 26 ) time for general directed 
graphs and 0(n 2 ) time for directed acyclic graphs; their algorithm is randomized 
with one-side error. The bounds of King and Sagert were further improved by 
King [HJ, who exhibited a deterministic algorithm on general digraphs with 0(1) 
query time and 0(n 2 logn) amortized time per update operations, where updates 
are insertions of a set of edges incident to the same vertex and deletions of an 
arbitrary subset of edges. We remark that all these algorithms (except |L5]]) use 
fast matrix multiplication subroutine. 

We observe that fully dynamic transitive closure algorithms with 0(1) query 
time maintain explicitly the transitive closure of the input graph, in order to 
answer each query with exactly one lookup (on its adjacency matrix). Since an 
update may change as many as Q(n 2 ) entries of this matrix, 0(n 2 ) seems to be 
the best update bound that one could hope for this class of algorithms. It is thus 
quite natural to ask whether the 0(n 2 ) update bound can be actually realized 
for fully dynamic transitive closure on general directed graphs while maintaining 
one lookup per query. Another important question, if one is willing to spend 
more time for queries, is whether the 0(n 2 ) barrier for the single-operation time 
complexity of fully dynamic transitive closure can be broken. We remark that 
this has been an elusive goal for many years. 

Our Results. In this paper, we affirmatively answer both questions. We first 
exhibit a deterministic algorithm for fully dynamic transitive closure on general 
digraphs that does exactly one matrix look-up per query and supports updates in 
0(n 2 ) amortized time, thus improving over Our algorithm can also support 
within the same time bounds the generalized updates of JOJ, i.e., insertion of a set 
of edges incident to the same vertex and deletion of an arbitrary subset of edges. 
In the special case of deletions only, our algorithm achieves 0(n) amortized time 
for deletions and 0(1) time for queries: this generalizes to directed graphs the 
bounds of [0, and improves over f{|. 



As our second contribution, we present the first algorithm that breaks through 
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the 0(n 2 ) barrier on the single-operation time complexity of fully dynamic tran- 
sitive closure. In particular, we show how to trade off query times for updates 
on directed acyclic graphs: each query can be answered in time 0(n e ) and each 
update can be performed in time 0(n a '' 1 ' e ' 1 ' e + n 1+e ), for any e G [0, 1], where 
e, 1) is the exponent of the multiplication of an n x n e matrix by aiin'xn 
matrix. Balancing the two terms in the update bound yields that e must satisfy 
the equation u(l,e,l) = 1 + 2e. The current best bounds on uj(1, e, 1) || |10| 
imply that e < 0.58 |2(|. Thus, the smallest update time is 0(n L58 ), which gives 
a query time of 0(n 0,58 ). Our subquadratic algorithm is randomized, and has 
one-side error. 

All our algorithms are based on a novel technique: we introduce a general 
framework for maintaining polynomials defined over matrices, and we cast fully 
dynamic transitive closure into this framework. In particular, our deterministic 
algorithm hinges upon the equivalence between transitive closure and matrix 
multiplication on a closed semiring; this relation has been known for over 30 
years (see e.g., the results of Munro |T3], Furman M and Fischer and Meyer J/J) 



and yields the fastest known static algorithm for transitive closure. Surprisingly, 
no one before seems to have exploited this equivalence in the dynamic setting: 
some recent algorithms ||, [14], [16] make use of fast matrix multiplication, but only 
as a subroutine for fast updates. Differently from other approaches, the crux of 
our method is to use dynamic reevaluation of products of Boolean matrices as 
the kernel for solving dynamic transitive closure. 

The remainder of this paper is organized as follows. We first formally define 
the fully dynamic transitive closure problem and we give preliminary definitions 
in Section A high-level overview of our approach is given in Section In 
Section [| we introduce two problems on dynamic matrices, and show how to 
solve them efficiently Next, we show how to exploit these problems on dynamic 
matrices for the design of three efficient fully dynamic algorithms for transitive 
closure in Section ||, Section |6| and Section [7| respectively. Finally, in Section || 
we list some concluding remarks. 



2 Fully Dynamic Transitive Closure 

In this section we give a more formal definition of the fully dynamic transitive 
closure problem considered in this paper. We assume the reader to be familiar 
with standard graph and algebraic terminology as contained for instance in ■ 

Definition 1 Let G = (V, E) be a directed graph and let TC(G) = (V, E') be 
its transitive closure. The Fully Dynamic Transitive Closure Problem 
consists of maintaining a data structure G for graph G under an intermixed se- 
quence a = (G.Opi, . . . ,G.0p fc ) of Initialization, Update, and Query opera- 
tions. Each operation G.Op - on data structure G can be either one of the following: 
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G . Insert({(u 1 ,v),(u 2 ,v),(v,u 3 ),(v,u 4 ),(v,u 5 )},v) 






G . Delete({(u 1 ,u 2 ),(u2,u 3 ),(u4,U5),(u 6 ,u 7 ),(u 8 ,u 9 )}) 

o 




(b) 



Figure 1: (a) Insert operation; (b) Delete operation as in Definition [l]. 

• G. Init(A): perform the initialization operation E <— A, where A C V x V. 

• G. Insert (y, I): perform the update E <— E U {(u, v ) | u G V A (w, v ) G 
/} U I m G V A (v, u) G J} ; where ICE and v G V. We call this 
update a v -Centered insertion in G. 

• G.Delete(D): perform the update E <— E — D , where D C E. 

• G. Query (x,y): perform a query operation on TC(G) by returning 1 if 
(x, y) G E' and otherwise. 

Few remarks are in order at this point. First, the generalized Insert and 
Delete updates considered here have been first introduced by King in |TI|. With 
just one operation, they are able to change the graph by adding or removing a 
whole set of edges, rather than a single edge, as illustrated in Figure Second, 
we consider explicitly initializations of the graph G and, more generally than 
in the traditional definitions of dynamic problems, we allow them to appear 
everywhere in sequence a. This gives more generality to the problem, and allows 
for more powerful data structures, i.e., data structures that can be restarted at 
run time on a completely different input graph. Differently from others variants 
of the problem, we do not address the issue of returning actual paths between 
nodes, and we just consider the problem of answering reachability queries. 

< o > 

It is well known that, if G = (V, E) is a directed graph and Xq is its adjacency 
matrix, computing the Kleene closure Xq of Xq is equivalent to computing the 
(reflexive) transitive closure TC(G) of G. For this reason, in this paper, instead 
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of considering directly the problem introduced in Definition [II we study an equiv- 
alent problem on matrices. Before defining it formally, we need some preliminary 
notation. 

Definition 2 If X is a matrix, we denote by Ix.i and Jx.j the matrices equal to 
X in the i-th row and j-th column, respectively, and null in any other entries: 



Definition 3 Let X and Y be nx n Boolean matrices. Then X C7 if and only 
ifX[x,y} = l Y[x,y} = I for any x,y <E {l,...,n}. 

We are now ready to define a dynamic version of the problem of computing 
the Kleene closure of a Boolean matrix. In what follows, we assume that algebraic 
operations + and — are performed modulo n + 1 by looking at Boolean values 
and 1 as integer numbers. Integer results are binarized by converting back 
nonzero values into 1 and zero values into 0. We remark that in our dynamic 
setting operator — is just required to flip matrix entries from 1 to 0. 

Definition 4 Let X be an n x n Boolean matrix and let X* be its Kleene closure. 
We define the Fully Dynamic Boolean Matrix Closure Problem as the 
problem of maintaining a data structure X for matrix X under an intermixed 
sequence a = (X.0p 1; . . . , X.0p fc ) of initialization, update, and query operations. 
Each operation X.Op^- on data structure X can be either one of the following: 

• X. Init*(y): perform the initialization operation X <— Y, where Y is an 
n x n Boolean matrix. 

• X.Set*(i, AX): perform the update X <— X + Iax,% + Jax,i, where AX is 
an nx n Boolean matrix and i e {1, . . . , n}. We call this kind of update an 
i-CENTERED set operation on X and we call AX Update Matrix. 

• X. Reset* (AX): perform the update X <— X — AX, where AX C X is an 
n x n Boolean update matrix. 

• X. Lookup* (x, y): return the value of X*[x,y], where x, y G {1, . . . , n}. 

Notice that Set* is allowed to modify only the i-th row and the i-th column 
of X, while Reset* and Init* can modify any entries of X. We stress the strong 
correlation between Definition ^ and Definition [I]: if G is a graph and X is 
its adjacency matrix, operations X.Init*, X.Set*, X. Reset*, and X. Lookup* are 
equivalent to operations G . Init, G . Insert, G . Delete, and G . Query, respectively. 
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3 Overview of Our Approach 



In this section we give an overview of the new ideas presented in this paper, 
discussing the most significant aspects of our techniques. 

Our approach consists of reducing fully dynamic transitive closure to the 
problem of maintaining efficiently polynomials over matrices subject to updates 
of their variables. In particular, we focus on the equivalent problem of fully 
dynamic Kleene closure and we show that efficient data structures for it can be 
realized using efficient data structures for maintaining polynomials over matrices. 

Suppose that we have a polynomial over Boolean matrices, e.g., P(X, Y, Z, W) = 
X + YZ 2 W, where matrices X, Y, Z and W are its variables. The value 
P(X, Y, Z, W) of the polynomial can be computed via sum and multiplication 
of matrices X, Y, Z and W in 0(n 2,38 ). Now, what kind of modifications can we 
perform on a variable, e.g., variable Z, so as to have the chance of updating the 
value of P(X, Y, Z, W) in less than 0(n 2 38 ) time? 

In Section [4.1| we show a data structure that allows us to reevaluate correctly 
P(X,Y, Z,W) in just 0(n 2 ) amortized time after flipping to 1 any entries of 
Z that were 0, provided they lie on a row or on a column (SetRow or SetCol 
operation), of after flipping to any entries of Z that were 1 (Reset operation). 
This seems a step forward, but are this kind of updates of variables powerful 
enough to be useful our original problem of fully dynamic transitive closure? 
Unfortunately, the answer is no. Actually, we also require the more general Set 
operation of flipping to 1 any entries of Z that were 0. Now, if we want to 
have our polynomial always up to date after each variable change of this kind, 
it seems that there is no way of doing any better than recomputing everything 
from scratch. 

So let us lower our expectations on our data structure for maintaining P, 
and tolerate errors. In exchange, our data structure must support efficiently 
the general Set operation. The term "errors" here means that we maintain a 
"relaxed" version of the correct value of the polynomial, where some 0's may 
be incorrect. The only important property that we require is that any l's that 
appear in the correct value of the polynomial after performing a SetRow or SetCol 
operation must also appear in the relaxed value that we maintain. This allows us 
to support any Set operation efficiently in a lazy fashion (so in the following we 
call it LazySet) and is powerful enough for our original problem of fully dynamic 
transitive closure. 

Actually, doing things lazily while maintaining the desired properties in our 
data structure for polynomials is the major technical difficulty in Section [O . 
Sections [5| and |6] then show two methods to solve the fully dynamic Boolean 
matrix closure problem by using polynomials of Boolean matrices as if they were 
building blocks. The second method yields the fastest known algorithm for fully 
dynamic transitive closure with constant query time. If we give up maintaining 
polynomials of degree > 1, using a surprisingly simple lazy technique we can 
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even support certain kinds of variable updates in subquadratic worst-case time 
per operation (see Section fOj ). This turns out to be once again applicable to 
fully dynamic transitive closure, yielding the first subquadratic algorithms known 
so far for the problem (see Section ^). 



4 Dynamic Matrices 

In this section we consider two problems on dynamic matrices and we devise fast 
algorithms for solving them. As we already stated, these problems will be central 
to designing efficient algorithms for the fully dynamic Boolean matrix closure 
problem introduced in Definition |j. In more detail, in Section |4.1| we address the 



problem of reevaluating polynomials over Boolean matrices under modifications 
of their variables. We propose a data structure for maintaining efficiently the 
special class of polynomials of degree 2 consisting of single products of Boolean 
matrices. We show then how to use this data structure for solving the more 



general problem on arbitrary polynomials. In Section [4.2| we study the problem 
of finding an implicit representation for integer matrices that makes it possible 
to update as many as Q(n 2 ) entries per operation in o(n 2 ) worst-case time at the 
price of increasing the lookup time required to read a single entry. 



4.1 Dynamic Reevaluation of Polynomials over Boolean 
Matrices 

We now study the problem of maintaining the value of polynomials over Boolean 
matrices under updates of their variables. We define these updates so that they 
can be useful later on for our original problem of dynamic Boolean matrix closure. 
We first need some preliminary definitions. 

Definition 5 Let X be a data structure. We denote by X{ the value of X at 
Time i, i.e., the value of X after the i-th operation in a sequence of operations 
that modify X . By convention, we assume that at time any numerical value in 
X is zero. In particular, if X is a Boolean matrix, X Q = n . 

In the following definition we formally introduce our first problem on dynamic 
matrices. 



Definition 6 Let B n be the set of n x n Boolean matrices and let 

h 
o=l 
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be a polynomial!^ with h terms defined over B n , where each 

6=1 

has degree exactly k and variables X£ e B n are distinct. We consider the problem 
of maintaining a data structure P for the polynomial P under an intermixed 
sequence a = (P.Op 1; . . . , P.Op ; ) of initialization, update, and query operations. 
Each operation P.Opj on the data structure P can be either one of the following: 

• P . Init(Z|, . . . , Z^) : perform the initialization X£ <— Z% of the variables of 
polynomial P, where each Z£ is an n x n Boolean matrix. 

• P . SetRow(i, AX, X£) : perform the row update operation X% <— X% + lAx,i, 
where AX is an n x n Boolean update matrix. The operation sets to 1 the 
entries in the i-th row of variable X% of polynomial P as specified by matrix 
AX. 

• P . SetCol(i, AX, X%): perform the column update operation X^ 1 <— X§ + 
J/±x,i, where AX is an n x n Boolean update matrix. The operation sets to 
1 the entries in the i-th column of variable X^ of polynomial P as specified 
by matrix AX. 

• P . LazySet(AX, X£): perform the update operation X^ <— X£ + AX, where 
AX is an n x n Boolean update matrix. The operation sets to 1 the entries 
of variable X§ of polynomial P as specified by matrix AX. 

• P .Reset(AX, X£): perform the update operation X£ <— X£ — AX, where 
AX is an n x n Boolean update matrix such that AX C X^. The operation 
resets to the entries of variable X^ of polynomial P as specified by matrix 
AX. 

• P.Lookup(): answer a query about the value of P by returning an n x n 
Boolean matrix Yj, such that Mj CFjC Pj, where M is an n x n Boolean 
matrix whose value at time j is defined as follows: 

M 3 = E ( p i - p i-0 

1 < i < j ■ 
0p i 7^ LazySet 

and Pi is the value of polynomial P at time i. According to this definition, 
we allow the answer about the value of P to be affected by one-side error. 

^dn the following, we omit specifying explicitly the dependence of a polynomial on its vari- 
ables, and we denote by P both the function P(Xi, . . . , X^) and the value of this function for 
fixed values of X\, . . . , X^, assuming that the correct interpretation is clear from the context. 
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SetRow and SetCol are allowed to modify only the z-th row and the z-th 
column of variable Xg, respectively, while LazySet, Reset and Init can modify 
any entries of Xg. It is crucial to observe that in the operational definition 
of Lookup we allow one-side errors in answering queries on the value of P. In 
particular, in the answer there have to be no incorrect l's and the error must be 
bounded: Lookup has to return a matrix Y that contains at least the l's in M, 
and no more than the l's in P. As we will see later on, this operational definition 
simplifies the task of designing efficient implementations of the operations and is 
still powerful enough to be useful for our original problem of dynamic Boolean 
matrix closure. 

The following lemma shows that the presence of errors is related to the pres- 
ence of LazySet operations in sequence a. In particular, it shows that, if no 
LazySet operation is performed, then Lookup makes no errors and returns the 
correct value of polynomial P. 

Lemma 1 Let P be a polynomial and let a = (P.0p 1; . . . , P.0p fc ) be a sequence of 
operations on P. If Op^ 7^ LazySet for all 1 < z < j < k, then Mj = Pj. 

Proof. The proof easily follows by telescoping the sum that defines Mf Mj = 

P 3 - Pj_i + Pj. x - Pj- 2 + --- + P 2 -P 1 +P 1 -P = P,-P = Pj. ' □ 

Errors in the answers given by Lookup may appear as soon as LazySet opera- 
tions are performed in sequence a. To explain how M is defined mathematically, 
notice that Mq = n by Definition |5] and M sums up all the changes that the 
value of P has undergone up to the j-th operation, except for the changes due 
to LazySet operations, which are ignored. This means that, if any entry P[x,y] 
flips from to 1 or vice- versa due to an operation Opj different from LazySet, so 
does M[x, y] and thus Y[x, y]. 

As a side note, we remark that it is straightforward to extend the results of 
this section to the general class of polynomials with terms of different degrees 
and multiple occurrences of the same variable. 

< o > 

We now focus on the problem of implementing the operations introduced in 
Definition ^. A simple-minded implementation of the operations on P is the 
following: 

• Maintain variables Xg, terms T a , and a matrix Y that contains the value 
of the polynomial. 

• Recompute from scratch T a and the value of Y = P = T\ + ■ ■ ■ + Th after 
each Init, SetRow, SetCol and Reset that change Xg. 

• Do nothing after a LazySet operation, except for updating X£. This means 
that Y may be no longer equal to P after the operation. 
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• Let Lookup return the maintained value of Y . 

It is easy to verify that at any time j, i.e., after the j-th operation, Opj ^LazySet 
implies Y = P and Opj =LazySet implies Y = M. In other words, the value Y 
returned by Lookup oscillates between the exact value P of the polynomial and 
the value M obtained without considering LazySet operations. 

With the simple-minded implementation above, we can support Init in 0(h- 
k-n^ + h-n 2 ) time, SetRow and SetCol in 0(k-n u) ) time, Reset in 0(k-n u) ' + h-n 2 ) 
time, and Lookup and LazySet in 0(n 2 ) time. 

The remainder of this section provides more efficient solutions for the problem. 
In particular, we present a data structure that supports Lookup and LazySet 
operations in 0(n 2 ) worst-case time, SetRow, SetCol and Reset operations in 
0(k ■ n 2 ) amortized time, and Init operations in 0(h ■ k ■ + h ■ n 2 ) worst-case 
time. The space used is 0(h ■ k 2 • n 2 ). Before considering the general case where 
polynomials have arbitrary degree k, we focus on the special class of polynomials 
where k = 2. 

4.1.1 Data Structure for Polynomials of Degree k — 2 

We define a data structure for P that allows us to maintain explicitly the value Yj 
of the matrix Y at any time j during a sequence (P.0p 1 , . . . , P-Op^) of operations. 
This makes it possible to perform Lookup operations in optimal quadratic time. 
We avoid recomputing from scratch the value of Y after each update as in the 
simple-minded method, and we propose efficient techniques for propagating to Y 
the effects of changes of variables X% due to SetRow, SetCol and Reset opera- 
tions. In case of LazySet, we only need to update the affected variables, leaving 
the other elements in the data structure unaffected. This, of course, implies that 
after a LazySet at time j, the maintained value Yj will be clearly not synchro- 
nized with the correct value Pj of the polynomial. Most technical difficulties of 
this section come just from this lazy maintenance of Yj. 

Our data structure for representing a polynomial of degree 2 of the form 
P = X\-Xl + ...+X^- X% is presented below. 

Data Structure 1 We maintain the following elementary data structures with 
0(h ■ n 2 ) space: 

1. 2h matrices X® and X% for 1 < a < h; 

2. h integer matrices Prodi, Prodh such that Prod a maintains a "lazy" 
count of the number of witnesses of the product T a = X® ■ X%. 

3. an integer matrix S such that S[x,y] = \{a : Prod a [x,y] > 0}|. We assume 
that Yj[x,y] = 1 S[x,y] > 0. 
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4- 2h integer matrices LastFlipx, one for each matrix X = X%. For any 
entry X[x,y] = 1, LastFlip x [x,y] is the time of the most recent operation 
that caused X[x,y] to flip from to 1. More formally: 

LastFlipx Ax, y] = max{t | X t [x,y] - X t -i[x,y] = 1} 

if Xj[x,y] = 1, and is undefined otherwise; 

5. 2h integer vectors LastRowx , one for each matrix X = X%. LastRowx[i] 
is the time of the last Init or SetRow operation on the i-th row of X , and 
zero if no such operation was ever performed. More formally: 

Last Row x [i] = max{0, t | 0p t = Init(. . .) V 0p t = SetRow(i, AX, X)} 

We also maintain similar vectors LastColx; 

6. a counter Time of the number of performed operations; 

Before getting into the full details of our implementation of operations, we 
give an overview of the main ideas. We consider how the various operations 
should affect the data structure. In particular, we suppose that an operation 
changes any entries of variable Xf in a term T a = X" • Xf , and we define what 
our implementation should do on matrix Prod a : 

SetRow/SetCol: if some entry Xf[x, y] is flipping to 1, then y becomes a witness 
in the product Xf ■ X 1 ^ for any pair x, z such that X%[y, z] — 1. Then we 
should put y in the count Prod Ax, z], if it is not already counted. Moreover, 
if some entry Xf[x, y] was already 1, but for some pair x,z the index y is 
not counted in Prod Ax, z], then we should put y in the count Prod a [x, z\. 

LazySet: if some entry Xf[x, y] is flipping to 1, then y becomes a witness for 
any pair x,z such that X^\y,z\ = 1. Then we should put y in the count 
Prod Ax, z], if it is not already counted, but we do not do this. 

Reset: if some entry X±[x, y] is flipping to 0, then y is no longer a witness for all 
pairs x, z such that X|[y, z] — 1. Then we should remove y from the count 
Prod Ax, z], if it is currently counted. 

Note that after performing LazySet there may be triples (x, y, z) such that 
both X"[x,y] = 1 and X^\y,z\ = 1, but y is not counted in Prod a [x,z\. Now 
the problem is: is there any property that we can exploit to tell if a given y is 
counted or not in Prod Ax, z] whenever both Xf [x, y] — 1 and X%[y, z] = 1? 

We introduce a predicate V a {x,y,z), 1 < x,y, z < n, such that V a (x,y,z) 
is true if and only if the last time any of the two entries X±[x, y] and X^y^z] 
flipped from to 1 is before the time of the last update operation on the x-th 
row or the y-th column of X" and the time of the last update operation on the 
2/-th row or the 2-th column of Xf . In short: 
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V a (x,y,z) := meot{LastFlipx*[x,y], LastFlipx a [y, z]} < 
m&x{LastRowxa [x], LastColx* [y], LastRowx%[y], LastColx%[z]} 

The property V a answers our previous question and allows it to define the 
following invariant that we maintain in our data structure. We remark that we 
do not need to maintain V a explicitly in our data structure as it can be computed 
on demand in constant time by accessing Last Flip and LastRow. 

Invariant 1 For any term T a = X" ■ X% in polynomial P, at any time during 
a sequence of operations a, the following invariant holds for any pair of indices 
x, z: 

Prod a [x,z] = \{y : X«{x,y] = l A X 2 %, z] = 1 A V a (x,y,z)}\ 

According to Invariant [1], if is clear that the value of each entry Prod a [x, z] 
is a "lazy" count of the number of witnesses of the Boolean matrix product 
T a [x,z] = {XI ■ X%)[x,z}. Notice that, since T a [x,z] = 1 ^ 3y : Xf[x,y] = 
1 A Xffy, z] = 1, we have that Prod a [x,z] > =>- T a [x,z] = 1. Thus, we may 
think of V a "relaxation" property. 

We implement the operations introduced in Definition || as described next, 
assuming that the operation Time <— Time + 1 is performed just before each 
operation: 

Init 

procedure Init(Z 1 1 , Z\,..., Z^, Z%) 

1. begin 

2. for each a do Xf «- Zf; X% *- Z% 

3. { initialize members 2-5 of Data Structure [l| } 

4. end 

Init assigns the value of variables and X^ and initializes elements 2-5 of Data 
Structure [I]. In particular, Last F 'lip x[x,y] is set to Time for any X[x, y] = 1 and 
the same is done for LastRow[i] and LastCol[i] for any i. Prod a is initialized by 
computing the product • Xf in the ring of integers, i.e., looking at X^ as integer 
matrices. 

Lookup 

function LookupQ 

1. begin 

2. return Y s.t. Y[x,y] = 1 S[x,y] > 

3. end 

Lookup simply returns a binarized version Y of matrix S defined in Data Structure |1|. 
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SetRow 



procedure SetRow(i, AX, X%) 

I. begin 

2- X% <— Xg + 

3. {update LastFlipx*} 

4. if 6 = 1 then 

5. for each x : Xf[i,x] = 1 do 

6. for each y : Xf[x, y] = 1 do 

7. if not V a (i,x,y) then 

8. Prod a [i, y] <- Prod a [i, y] + l 

9. if Prod a [i, y] = 1 then y] «- y] + 1 

10. else{6 = 2: similar to P . SetCol(i, AX, Xf )} 

II. LastRowxg [i] <— Time 
12. end 



After performing an i-centered insertion in on line 2 and after updating 
LastFlipxg on line 3, SetRow checks on lines 5-7 for any triple (i,x,y) such 
that the property V a (i,x,y) is still not satisfied, but will be satisfied thanks to 
line 11, and increases Prod a and S accordingly (lines 8-9). 

SetCol 



procedure SetCol(i, AX, Xg) 

1. begin 

2. XI <— X^ 1 + Jax,« 

3. {update LastFlipx a } 

4. if 6 = 1 then 

5. for each x : Xf[x,i] = 1 do 

6. for each y : Xff^y] = 1 do 

7. if not V a (x,i,y) then 

8. Prod a [x, y] <- Prod a [x, y) + 1 

9. if Prod a [x, y] = 1 then S[x,y] <— £[a;,y] + 1 

10. else {6 = 2: similar to P . SetRow(i, AX, Xf )} 

11. LastColx*[i] ^ Time 

12. end 



Similar to SetRow. 

LazySet 

procedure LazySet (AX, X%) 

1. begin 

2. X 6 a <- X fe a + AX 

3. {update LastFlipxg} 

4. end 

LazySet simply sets to 1 any entries in and updates LastFlipxg- We remark 
that no other object in the data structure is changed. 



14 



Reset 



procedure Reset(AX, Xg) 

1. begin 

2. if 6 = 1 then 

3. for each x,y : AX[x,y] = 1 do 

4. if m&x{LastRowx a [x], LastColx a [y]} > LastFlipx a [x,y] then 

5. for each z : Xf [y, z] = 1 do 

6. if "P a (a;, y, z) then 

7. Prod a [x,z] <— Prod a [x, z] — 1 

8. if Prod a [x, z] = then z] <— z] — 1 

9. else { here max{LastRowx^[x], LastColxf[y]} < LastFlipx^[x,y] } 

10. for each z : Xf[y, z) = 1 A LastColx%[z] > LastFlipx*[x,y] do 

11. if V a (x, y, z) then 

12. Prod a [x, z] <— Prod a [x, z] — 1 

13. if Prod a [x, z] = then S[x, z] <— S[x, z] — 1 

14. else {6 = 2 similar to 6 = 1} 

15. X 6 a <- XI - AX 

16. end 



In lines 2-14, using LastRowxg, LastColxg, and LastFlipxg, Reset updates 

Prod a and S so as to maintain Invariant |I[ Namely, for each reset entry (x, y) 

specified by AX (line 3), it looks for triples (x, y, z) such that V(x, y, z) is going 

to be no more satisfied due to the reset of [x, y] to be performed (lines 5-6 and 

lines 10-11); Prod a and S are adjusted accordingly (lines 7-8 and lines 12-13). 

The distinction between the two cases max{ LastRowx* [x], LastColx^ [y]} > LastFlipx^[ x , y ] 

and max{ LastRowx^ [x], LastColx* [y]} < Last Flip x a [x,y] i n ime 4 and in line 9, 

respectively, is important to achieve fast running times as it will be discussed in 

the proof of Theorem |2|. Here we only point out that if the test in line 4 succeeds, 

then we can scan any z s.t. X^[y, z] = 1 without affecting the running time. If 

this is not the case, then we need to process only indices z such that the test 

LastColx%[y, z] > LastFlip X a[x,y] is satisfied, and avoid scanning other indices. 

For this reason line 10 must be implemented very carefully by maintaining indices 

z in a list and by using a move-to-front strategy that brings index z to the front of 

the list as any operation Init(. . .), SetRow(^, . . .) or SetCol(,2, . . .) is performed 

on z. In this way indices are sorted according to the dates of operations on them. 

As last step, Reset resets the entries of X% as specified by AX (line 15). 

< o > 

The correctness of our implementation of operations Init, SetRow, SetCol, 
LazySet, Reset and Lookup is discussed in the following theorem. 

Theorem 1 At any time j , Lookup returns a matrix Yj that satisfies the relation 
Mj <^YjC Pj as in Definition [6|. 

Proof. We first remind that Y is the binarized version of S as follows from the 
implementation of Lookup. 
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To prove that observe that SetRow increases Prod a [i,y] (line 8), and 

possibly S (line 9), only if both X"[z,x] = 1 and X%[x, y] = 1: this implies that 
T a [i,y] = 1 and P[i,y] = 1. 

To prove that M C Y, notice that at time j after performing an opera- 
tion Opj=SetRow(z, AX, Xg) on the z-th row of Xf, V(i,x,y) is satisfied for any 
triple (i,x,y) such that = 1 and X%[x, y] = 1 thanks to the operation 

LastRowxg[i] Time (line 11). For X^ the proof is analogous. Now, all such 
triples (i,x,y) are enumerated by SetRow (lines 5-6): for each of them such that 
V a (i,x,y) was false at time j — 1, Prod a [i,y] is increased and possibly S[i,y] is 
increased as well (lines 7-9). If P[i,y] flips from to 1, then necessarily 
flips from to 1 for some x, and then, as stated above w.r.t. V a , Prod a [i,y] gets 
increased. Thus, recalling that Y is the binarized version of S, we have for any 

y- 

p Ahy] - p 3-ilhy} = i YAhy} - ViM = L 

From the definition of M in Definition |6| we have that: 

Mj[i,y] - Mj^y] = 1 & Pj[i,y] - P 3 -i[i,y] = 1. 

This proves the relation M C Y. A similar argument is valid also for SetCol, 
while LazySet does not affect S at all. 

To complete the proof we remark that Y = P just after any Init operation 
and that Reset leaves the data structure as if reset entries were never set to 
1. Indeed, Reset can be viewed as a sort of "undo" procedure that cancels the 
effects of previous SetRow, SetCol or Init operations. □ 

We now analyze the complexity of our implementation of the operations on 
polynomials. 

Theorem 2 Any Lookup, SetRow, SetCol and LazySet operation requires 0(n 2 ) 
time in the worst case. Any Init requires 0(h-n w + h-n 2 ) worst-case time, where 
to is the exponent of matrix multiplication. The cost of any Reset operation can 
be charged to previous SetRow, SetCol and Init operations. The maximum cost 
charged to each Init is 0(h ■ n 3 ). The space required is 0(h ■ n 2 ). 

Proof. It is straightforward to see from the pseudocode of the operations that 
any SetRow, SetCol and LazySet operation requires 0(n 2 ) time in the worst 
case. 

Init takes 0(h-n w + h- n 2 ) in the worst case: in more detail, each Prod a can 
be directly computed via matrix multiplication and any other initialization step 
requires no more than 0(n 2 ) worst-case time. 

To prove that the cost of any Reset operation can be charged to previous 
SetRow, SetCol and Init operations, we use a potential function 

$a = J2Prod a [x,y] 
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associated to each term T a of the polynomial. From the relation: 

Prod a [x,z] = \{y : Xf[x,y] = l A X«[y,z] = l A V a {x,y,z)}\ 

given in Invariant [T], it follows that < Prod a [x, z] < n for all x,z. Thus, 
< $ a < n 3 . 

Now, observe that SetRow increases $ a by at most n 2 units per operation, 
while Init increases $ a by at most n 3 units per operation. Note that LazySet 
does not affect $ a . We can finally address the case of Reset operations. Con- 
sider the distinction between the two cases max{LastRowx°[x], LastColx*[y]} > 
LastFlipx^[ x ,y] in line 4 and max{LastRow x* [x], LastColx* [y]} < LastFlipx^[ x ,y] 
in line 9. In the first case, we can charge the cost of processing any triple (x, y, z) 
to some previous operation on the x-th row of X% or to some previous operation 
on the y-th. column of X"; in the second case, we consider only those (x, y, z) for 
which some operation on the z-th column of X%[y, z] was performed after both 
X^[x, y] and X% [y, z] were set to 1. In both cases, any Reset operation decreases 
$ a by at most n units for each reset entry of Xg , and this can be charged to 
previous operations which increased $ a . □ 

The complex statement of the charging mechanism encompasses the dynamics 
of our data structure. In particular, we allow Reset operations to charge up to a 
0(n 3 ) cost to a single Init operation. Thus, in an arbitrary mixed sequence with 
any number of Init, Reset takes 0(n 3 ) amortized time per update. If, however, 
we allow Init operations to appear in a only every fl(n) Reset operations, the 
bound for Reset drops down to 0(n 2 ) amortized time per operation. 

As a consequence of Theorem |^, we have the following corollaries that refine 
the analysis of the running time of Reset operations. 

Corollary 1 If we perform just one Init operation in a sequence a of length 
Q(n), or more generally one Init operation every Q(n) Reset operations, then 
the amortized cost of Reset is 0{n 2 ) per operation. 

Corollary 2 If we perform just one Init operation in a sequence a of length 
Q(n 2 ), or more generally one Init operation every Q(n 2 ) Reset operations, and 
no operations SetRow and SetCol, then the amortized cost of Reset is 0(n) per 
operation. 

In the following, we show how to extend the previous techniques in order to 
deal with the general case of polynomials of degree k > 2. 
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4.1.2 Data Structure for Polynomials of Degree k > 2 

To support terms of degree k > 2 in P, we consider an equivalent representation 
P of P such that the degree of each term is 2. This allows us to maintain a data 
structure for P with the operations defined in the previous paragraph. 

Lemma 2 Consider a polynomial 

a=l a=l 

with h terms where each term T a has degree exactly k and variables X b are 
Boolean matrices. Let P be the polynomial over Boolean matrices of degree 2 
defined as 



h k 

l 6,fc-6-l 



a=l b=0 

where L% • and Rfj are polynomials over Boolean matrices of degree < 2 defined 
as 

ra _ f -v;; , • /-;:., , •/ j e [0,6-1] 

b ' j I In if 3 = "I 



B^_ x ■ Xj} +1+j if j £ [0,k — b — 1] 



1 In if 3 = "I 

Then P = P. 

Proof. To prove the claim, it suffices to check that 

k 

rp \ " T a pa 

1 a — Z^-^fe.b-l ' "bjfe-b-l 
6=0 

Unrolling the recursion for L bb _ 1 , we obtain: 

t a _ ya j a _ ya ya j a _ ya ya ya j 

H,b-1 — A l ' ^6,6-2 — A l " A 2 ' -^6,6-3 ~ ' ' ' ~ ^\ ' ^2 " ' ^6 ' 1 n 

Likewise, R bk _ b _i = I n ■ X b+1 ■ ■ ■ X k holds. Thus, by idempotence of the closed 
semiring of Boolean matrices, we finally have: 

k k 

Ej a r>a _ \ y ya ya ya ya ya ya rp 

-^6,6-1 ' n b,k-b-l — 2^1 ' ' '^b ' A b+1 ' " A fc — ^-1 " " A fe — J a- 

6=0 6=0 

□ 

Since P, L b - and R b j are all polynomials of degree < 2, they can be repre- 
sented and maintained efficiently by means of instances of Data Structure [1]. Our 
data structure for maintaining polynomials of degree > 2 is presented below: 
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T 




Xi 




^b-i 



X 



b+1 



X 



b+2 




L 



b.l 



R 



b.l 



■L 



b,b-l 



R-b.k-b-1 




Figure 2: Revealing new l's in Y while updating a term T of degree fc by means of 
a SetCol operation on variable Xf, or by means of a SetRow operation on variable 



Data Structure 2 We maintain explicitly the k 2 polynomials L£ ■ and R%j with 
instances of Data Structure VKe a/so maintain polynomial P with an instance 
Y of Data Structure ||. 

We now consider how to support SetRow, SetCol, LazySet, Reset, Init and 
Lookup in the case of arbitrary degree. We denote by SetRow£ =2 and SetCol/ c=2 
the versions of SetRow and SetCol implemented for k = 2. 

SetCol, SetRow 

procedure SetCol(z, AX, X%) 

1. begin 

2. XI <— + Jax,i 

3. for j <— 1 to b — 1 do 

4. Lgj.SetCol fc=2 (i, AL^,^) { it holds = X%_. ■ L%._ x } 

5. for j <— 1 to k — b — 1 do 

6. Rg d .SetRow fc=2 (i, A^i^fej-l) { * holds = • X 6 ° +1 +j } 

7. Y.SetCol fc=2 (i,Ai:g )6 _ 1 ,Lg t6 _ 1 ) 

8. Y.SetRow fc=2 (i, A^ k _ b _ v R^^) 

9. for j <- 1 to fc - 6 do L£ +jJ .LazySet(AX£,X£) 

10. for j <- 1 to 6 - 2 do Rg_jli j .LazySet (AX 6 a , X fe a ) 

11. end 

The main idea behind SetCol is to exploit associativity of Boolean matrix mul- 
tiplication in order to propagate changes of intermediate polynomials that are 
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always limited to a row or a column and thus can be efficiently handled by means 
of operations like SetRow fc=2 and SetCol fc=2 . 

In lines 3-4 SetCol propagates via SetColfc =2 the changes of the i-th column 
of Xg to then the changes of the i-th column of to L% 2 , and so on 

through the recursive decomposition: 



ra 

H,0 

L b,l 
ra 
L b,2 


= x a b 
= xi 
= xi 


In 

. T a — 


ya 
A 6-2 


va 
A 6-l 

• xu 


xi 
■xt 
■x a h 


ra 

H,b-1 


= x« 


r a ya 

' ^b,b-2 — ^1 ' 


va 
' " A 6-2 


■ xu 


■xt 



Likewise, in lines 5-6 it propagates via SetRowfc =2 a null matrix of changes of 
the i-th row of Xg +l to R%i, then the changes (possibly none) of the i-th row of 
11%! (due to the late effects of some previous LazySet) to R% 2 , and so on through 
the recursive decomposition: 



pa 

pa 
K b,l 
pa 
K b,2 


— In' X^ 
pa va 

— -"-6,0 " ^H+l 
pa ya 

— n b,l ' ^-b+2 


ya 

- A fe+1 
ya 

ya 

- A 6+l 


ya 
' A 6+2 

ya 
■ A 6+2 


ya 
■ A 6+3 


pa 

n b,k-b-l 


pa ya 

— n b,k-b-2 ' 


ya 
- A fe+1 


ya 
■ A 6+2 


ya 
' A 6+3 



We remark that both loops in lines 3-4 and in lines 5-6 reveal, gather and 
propagate any l's that appear in the intermediate polynomials due to the late 
effects of some previous LazySet. In particular, even if the presence of lines 5-6 
may seem strange because AX£ +1 = n , these lines are executed just for this 
reason. 

Finally, in lines 7-8 changes of ^ b _ 1 and i?^ fc _ 6 _ 1 are propagated to Y, which 
represents the maintained value of P, and in lines 9-10 new l's are lazily inserted 
in any other polynomials that feature X£ as a variable. 

We omit the pseudocode for SetRow because it is similar to SetCol. 

Reset , LazySet , Init , Lookup 

Reset(AX, X£) can be supported by propagating via Resetfc =2 any changes of X£ 
to any intermediate polynomial L^ v and R% that contains it, then changes of such 
polynomials to any polynomials which depend on them and so on up to Y. 

LazySet(AX, X% ) can be supported by performing LazySetfe =2 operations on each 
polynomial L a u v and R% v that contains Xg. 

Init(Zj L , . . . , Z%) can be supported by invoking Init fe=2 on each polynomial L™ , 
R% v and by propagating the intermediate results up to Y. 

Lookup () can be realized by returning the maintained value Y of P. 
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To conclude this section, we discuss the correctness and the complexity of our 
operations in the case of polynomials of arbitrary degree. 

Theorem 3 At any time j , Lookup returns a matrix Yj that satisfies the relation 
Mj CFjC Pj as in Definition [6|. 

Proof. Since P = P by Lemma 0, we prove that: 

P j ^Y j DM j = £ P^). 

1 < i < j ■ 
0p i ^ LazySet 

To this aim, it is sufficient to prove that any 1 that appears (or disappears) 
in the correct value of P due to an operation different from LazySet appears (or 
disappears) in Y as well, and that any entry of Y equal to 1 is also equal to 1 in 
P. 

• SetCol/SetRow: assume a SetCol operation is performed on the z'-th col- 
umn of variable (see Figure D). By induction, we assume that all new 
l's are correctly revealed in the z-th column of our data structure for L b - 
after the j-th iteration of SetColfc =2 in line 4. Notice that AL£ ■ = Jal% 
that is changes of L£ ■ are limited to the 2-th column: this implies that 
these changes can be correctly propagated by means of a SetCol opera- 
tion to any polynomial that features L£ • as a variable. As a consequence, 
by Theorem [I], the j + 1-th iteration of SetColfc =2 in line 4 correctly re- 
veals all new l's in our data structure for L b j +1 , and again these new 
l's all lie on its z-th column. Thus, at the end of the loop in lines 3-4, 
all new l's appear correctly in the z-th column of L bb _ l . Similar con- 
siderations apply also for R bk _ b _ v To prove that lines 7-8 insert cor- 
rectly in Y all new l's that appear in P and that Y C P we use again 
Theorem [I] and the fact that any 1 that appears in P also appears in 
^66-1 ' Rlk-b-v Indeed, for any entry P[x, y] that flips from to 1 due 
to a change of the z'-th column of X b or the z-th row of X b+1 there is 
a sequence of indices x = uq, Ui, . . . , «6 = i, • • • , Wfe-ij Uk = y 
such that Xj[uj-!,Uj] = 1, 1 < j < k, and either one of X*[v,b-!,i] or 
X b+l [i, Ub + i] just flipped from to 1 due to the SetRow/SetCol operation. 
The proof for SetRow is completely analogous. 

• Reset: assume a Reset operation is performed on variable Xg. AsResetfc =2 
can reset any subset of entries of variables, and not only those lying on a row 
or a column as in the case of SetRow^. =2 and SetColfc =2 , the correctness of 
propagating any changes of X b to the polynomials that depend on it easily 
follows from Theorem [I]. 

• Init: each Init operation recomputes from scratch all polynomials in Data 
Structure 0. Thus Y = P after each Init operation. 
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Theorem 4 Any Lookup and LazySet operation requires 0(n 2 ) time in the worst 
case. Any SetRow and SetCol operation requires 0(k ■ n 2 ) amortized time, and 
any Init operation takes 0(h ■ k ■ + h ■ n 2 ) worst-case time. The cost of any 
Reset operation can be charged to previous SetRow, SetCol and Init operations. 
The maximum cost charged to each Init is 0(h ■ k ■ n 3 ). The space required is 
0(h-k 2 -n 2 ). 

Proof. The proof easily follows from Theorem |2|. □ 
As in the previous paragraph, we have the following corollaries. 

Corollary 3 If we perform just one Init operation in a sequence a of length 
Q(n), or more generally one Init operation every Q(n) Reset operations, then 
the amortized cost of Reset is 0(k ■ n 2 ) per operation. 

Corollary 4 If we perform just one Init operation in a sequence a of length 
Q(n 2 ), or more generally one Init operation every Q(n 2 ) Reset operations, and 
we perform no operations SetRow and SetCol, then the amortized cost of Reset 
is 0(k ■ n) per operation. 

4.2 Maintaining Dynamic Matrices over Integers 

In this section we study the problem of finding an implicit representation for 
a matrix of integers that makes it possible to support simultaneous updates of 
multiple entries of the matrix very efficiently at the price of increasing the lookup 
time required to read a single entry. This problem on dynamic matrices will be 
central to designing the first subquadratic algorithm for fully dynamic transitive 
closure that will be described in Section [7|. We formally define the problem as 
follows: 

Definition 7 Let M be an n x n integer matrix. We consider the problem of 
performing an intermixed sequence a = (M.0p 1; . . . ,M.0p ; ) of operations on M, 
where each operation M.Op^- can be either one of the following: 

• M. Init(X): perform the initialization M <— X , where X is annxn integer 
matrix. 

• M.Update(J, /): perform the update operation M M + J ■ I , where J is 
an n x 1 column integer vector, and I is a 1 x n row integer vector. The 
product J ■ I is an n x n matrix defined for any 1 < x, y < n as: 

(J-I)[x,y] = J[x]-I[y] 
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• M.Lookup(x, y): return the integer value M[x,y]. 

It is straightforward to observe that Lookup can be supported in unit time and 
operations Init and Update in 0(n 2 ) worst-case time by explicitly performing 
the algebraic operations specified in the previous definition. 

In the following we show that, if one is willing to give up unit time for Lookup 
operations, it is possible to support Update in 0(n a '' 1 ' e ' 1 ' e ) worst-case time for 
each update operation, for any e, < e < 1, where cu(l,e, 1) is the exponent of 
the multiplication of an n x n € matrix by ann'xii matrix. Queries on individual 
entries of M are answered in 0(n € ) worst-case time via Lookup operations and 
Init still takes 0(n 2 ) worst-case time. 

We now sketch the main ideas behind the algorithm. We follow a simple lazy 
approach: we log at most n e update operations without explicitly computing 
them and we perform a global reconstruction of the matrix every n e updates. 
The reconstruction is done through fast rectangular matrix multiplication. This 
yields an implicit representation for M which requires us to run through logged 
updates in order to answer queries about entries of M. 

Data Structure 

We maintain the following elementary data structures with 0(n 2 ) space: 

• an n x n integer matrix Lazy which maintains a lazy representation of M; 

• an n x n e integer matrix Bufj in which we buffer update column vectors 
J; 

• an n e x n integer matrix Bufi in which we buffer update row vectors /; 

• a counter t of the number of performed Update operations since the last 
Init, modulo n e . 

Before proposing our implementation of the operations introduced in Defi- 
nition [7], we discuss a simple invariant property that we maintain in our data 
structure and that guarantees the correctness of the implementation of the oper- 
ations that we are going to present. We use the following notation: 

Definition 8 We denote by Bufj{j) the n x j matrix obtained by considering 
only the first j columns ofBufj. Similarly, we denote by Bufi(i) theixn matrix 
obtained by considering only the first i rows of Bufj. 

Invariant 2 At any time t in the sequence of operations a, the following invari- 
ant is maintained: 

M = Lazy + Bufj(t) ■ Bufi{t). 
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Update 

procedure Update(J, I) 

1. begin 

2. i<-i + l 

3. if t < n e then 

4. Bufj[.,t]<-J 

5. Bu/j[t,.]«-I 

6. else 

7. £ <- 

8. Lazy <— Lazy + Bufj ■ Bufj 

9. end 

Update first increases £ and, if £ < n e , it copies column vector J onto the £-th 
column of Bufj (line 4) and row vector I onto the £-th row of Bufj (line 5). If 
£ > there is no more room in Bufj and Bufj for buffering updates. Then the 
counter £ is reset in line 7 and the reconstruction operation in line 8 synchronizes 
Lazy with M via rectangular matrix multiplication of the n x n e matrix Bufj 
by the n e x n matrix Bufj. 

Lookup 

procedure Lookup(x,y) 

1. begin 

2. return Lazy[x, y] + £$=i Bufj[x,j] ■ Bufi\j, y] 

3. end 

Lookup runs through the first £ columns and rows of buffers Bufj and Bufi, 
respectively, and returns the value of Lazy corrected with the inner product of 
the x-th row of Bufj(t) by the y-th column of Bufi(t). 

Init 

procedure Init(X) 

1. begin 

2. Lazy <— X 

3. t <- 

4. end 

Init simply sets the value of Lazy and empties the buffers by resetting £. 

The following theorem discusses the time and space requirements of operations 
Update, Lookup, and Init. As already stated, the correctness easily follows from 
the fact that Invariant |2] is maintained throughout any sequence of operations. 

Theorem 5 Each Update operation can be supported in 0{n u ^ 1,e,lS} ~ e ) worst-case 
time and each Lookup in 0(n e ) worst-case time, where < e < 1 and cu(l,e, 1) 
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is the exponent for rectangular matrix multiplication. Init requires 0(n 2 ) time 
in the worst case. The space required is 0(n 2 ). 

Proof. An amortized update bound follows trivially from amortizing the cost of 
the rectangular matrix multiplication Bufj ■ Bufi against n e update operations. 
This bound can be made worst-case by standard techniques, i.e., by keeping two 
copies of the data structures: one is used for queries and the other is updated by 
performing matrix multiplication in the background. 

As fas as Lookup is concerned, it answers queries on the value of M[x,y] in 
worst- case time, where t < n e . □ 



Corollary 5 If 0(n w ) is the time required for multiplying two n x n matrices, 
then we can support Update in 0(n 2 ~( 3 ~^ e ) worst-case time and Lookup in 0(n e ) 
worst-case time. Choosing e = 1, the best known bound for matrix multiplication 
(uo < 2.38,) implies an 0(n 138 ) Update time and an 0(n) Lookup time. 

Proof. A rectangular matrix multiplication between a n x n e matrix by a n £ x 
n matrix can be performed by computing O^n 1 " 6 ) 2 ) multiplications between 
n e x n e matrices. This is done in O ((n l ~ € ) 2 ■ (n 6 )^). The amortized time of the 
reconstruction operation Lazy <— Lazy+Bufj-Bufi is thus O — ^ ^ +n ) = 
Q^ n 2-(3-u))ty rjj^ reg ^ Q £ ^ e c j a j m f u ows from Theorem || □ 



5 Transitive Closure Updates in 0(n 2 logn) Time 

In this section we show a first method for casting fully dynamic transitive closure 
into the problem of reevaluating polynomials over Boolean matrices presented in 
Section Ol. 



Based on the technique developed in Section |4.1| , we revisit the dynamic graph 



algorithm given in in terms of dynamic matrices and we present a matrix- 
based variant of it which features better initialization time while maintaining the 
same bounds on the running time of update and query operations, i.e., 0{n 2 -log n) 
time per update and 0(1) time per query. The space requirement of our algorithm 
is M{n) ■ logn, where M(n) is the space used for representing a polynomial over 
Boolean matrices. As stated in Theorem |], M{n) is 0{n 2 ) if h and k are constant. 

In the remainder of this section we first describe our data structure and then 
we show how to support efficiently operations introduced in Definition [| for the 
equivalent problem of fully dynamic Boolean matrix closure. 



5.1 Data Structure 

As it is well known, the Kleene closure of a Boolean matrix X can be computed 
from scratch via matrix multiplication by computing log 2 n polynomials Pk = 
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Pk-i + -Pfc_i, 1 < k < log 2 n. In the static case where X* has to be computed only 
once, intermediate results can be thrown away as only the final value X* = P\ og2 n 
is required. In the dynamic case, instead, intermediate results provide useful 
information for updating efficiently X* whenever X gets modified. 

In this section we consider a slightly different definition of polynomials Pi, ... , P\ , 
with the property that each of them has degree < 3: 

Definition 9 Let X be an n x n Boolean matrix. We define the sequence of 
log 2 n + 1 polynomials over Boolean matrices Po, . . . , P\og 2 n as: 

f X if k = 

k ~\ Pk-i + PL. + Pti ifk>0 

Before describing our data structure for maintaining the Kleene closure of X, 
we discuss some useful properties. 

Lemma 3 Let X be an n x n Boolean matrix and let P& be formed as in Defi- 
nition |]. Then for any 1 < u, v < n, Pk[u, v ] = 1 if and only if there is a path 
u ~» v of length at most 3 fc in X. 

Proof. The proof is by induction on k. The base (k = 0) is trivial. We assume 
by induction that the claim is satisfied for Pk-i and we prove that it is satisfied 
for Pk as well. 

Sufficient condition: Any path of length up to 3 fc between u and v in X is 
either of length up to 3 fc_1 or it can be obtained as concatenation of three paths 
of length up to 3 fc_1 in X. Since all these paths are correctly reported in Pk-i 
by the inductive hypothesis, it follows that Pk-i[u,v] = 1 or P^_ 1 [w,t>] = 1 or 
Pl^iu, v] = 1. Thus P k [u, v] = P h _ x [u, v] + P^u, v] + Pl_ x [u, v] = l. 

Necessary condition: If Pk[u,v] = 1 then at least one among Pk-i[u,v], 
Pk-i[ u , v ] an d Pfc_i[w> v] is 1. If Pk-i[u, v] = 1, then by the inductive hypothesis 
there is a path of length up to 3 fe_1 < 3 k . If P^^UjV) = 1, then there are two 
paths of length up to 3 fc_1 whose concatenation yields a path no longer than 3 fc . 
Finally, if P^Jit, v] = 1, then there are three paths of length up to 3 fc_1 whose 
concatenation yields a path no longer than 3 k . □ 



Lemma 4 Let X be an n x n Boolean matrix and let Pk be formed as in Defini- 
tion |. Then X* = I n + P l0 g 2 „. 

Proof. The proof easily follows from Lemma [3] and from the observation that that 
the length of the longest simple path in X is no longer than n— 1 < 3 log3 11 < 3 log2 n . 
I n is required to guarantee the reflexivity of X*. □ 

Our data structure for maintaining X* is the following: 
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Data Structure 3 We maintain an n x n Boolean matrix X and we maintain 
the log 2 n polynomials Pi . . . P\ og2 n °f degree 3 given in Definition ^ with instances 
of Data Structure presented in Section 

As we will see in Section |5.2| , the reason for considering the extra term P%_i 
in our data structure is that polynomials need to be maintained using not only 
SetRow/SetCol, but also LazySet. As stated in Definition ^, using LazySet 
yields a weaker representation of polynomials, and this forces us to increase the 
degree if complete information about X* has to be maintained. This aspect will 
be discussed in more depth in the proof of Theorem |. 

5.2 Implementation of Operations 

In this section we show that operations Init*, Set*, Reset* and Lookup* in- 
troduced in Definition || can all be implemented in terms of operations Init, 
LazySet, SetRow, and SetCol (described in Section [lTD on polynomials Pi . . . P\ og2 

Init* 

procedure Init*(X) 

1. begin 

2. Y <- X 

3. for k = 1 to log 2 n do 

4. P fc .Init(y) 

5. Y <— Pfc.LookupQ 

6. end 

Init* performs P^.Init operations on each P^ by propagating intermediate re- 
sults from X to Pi, then from Pi to P 2 , and so on up to P\ g 2 n- 

Lookup* 

procedure Lookup*(x, y) 

1. begin 

2. Y ^Pi og2n .Lookup() 

3. return I n + Y[x, y] 

4. end 

Lookup* returns the value of P\ og2n [x,y}. 
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Set* 



l. 
2. 
3. 
4. 
5. 
6. 
7. 
8. 



procedure Set* (i, AX) 
begin 



AY <- AX 



for k = 1 to log 2 n do 



P fc .LazySet(Ay,P fc _ 1 ) 
P fc .SetRow(i, AY, Pk-i) 
P fc .SetCol(i,Ay,P fc _i) 
Ay <-P fc .Lookup() 



end 



Set* propagates changes of Pk-i to Pk for any k — 1 to log 2 n. Notice that any 
new l's that appear in Pk-i are inserted in the object Pk via LazySet, but only 
the i-th row and the i-th row column of Pk-i are taken into account by SetRow 
and SetCol in order to determine changes of P&. As re-inserting l's already 
present in a variable is allowed by our operations on polynomials, for the sake of 
simplicity in line 7 we assign the update matrix AY with Pk and not with the 
variation of P&. 

Reset* 

procedure Reset* (AX) 

1. begin 

2. Ay <- AX 

3. for k = 1 to log 2 n do 

4. y <— Pfc.LookupQ 



Reset* performs P^. Reset operations on each Pk by propagating changes specified 
by AX to Pi, then changes of P\ to P 2 , and so on up to Pi og2 n . Notice that we use 
an auxiliary matrix Y to compute the difference between the value of Pk before 
and after the update and that the computation of Ay in line 6 always yields a 
Boolean matrix. 

5.3 Analysis 

In what follows we discuss the correctness and the complexity of our implemen- 
tation of operations Init*, Set*, Reset*, and Lookup* presented in Section |5.2| . 
We recall that X is an n x n Boolean matrix and P&, < k < log 2 n, are the 
polynomials introduced in Definition |S|. 

Theorem 6 If at any time during a sequence o of operations there is a path of 
length up to 2 h between x and y in X, then Pk[x,y] = 1. 



5. 
6. 

7. end 



P fc .Reset(Ay,P fc _ 1 ) 
Ay <- y-P fc .Lookup() 
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Proof. By induction. The base is trivial. We assume that the claim holds 
inductively for Pk-i, and we show that, after any operation, the claim holds also 
forP fc . 

• Init*: since any Init* operation rebuilds from scratch P&, the claim holds 
from Lemma |3|. 

• Set*: let us assume that a Set* operation is performed on the i-th row and 
column of X and a new path rr of length up to 2 fc , say 7T = (x, . . . , i, . . . , y), 
appears in X due to this operation. We prove that Pk[x,y] = 1 after the 
operation. 

Observe that P& .LazySet(APfc_i, Pk-i) puts in place any new l's in any 
occurrence of the variable Pk-i in data structure We remark that, 
although the maintained value of Pk in data structure P& is not updated 
by LazySet and therefore the correctness of the current operation is not 
affected, this step is very important: indeed, new l's corresponding to new 
paths of length up to 2 fc ~ 1 that appear in X will be useful in future Set* 
operations for detecting the appearance of new paths of length up to 2 fc . 

If both the portions x % and i ~> y of ir have length up to 2 fe_1 , then 7r gets 
recorded in P%_i, and therefore in Pk, thanks to one of P& . SetRow(i, APk-i, Pk-i) 
or Pfc . SetCol(i, APk-i, Pfc-i). On the other hand, if i is close to (but does 
not coincide with) one endpoint of 7T, the appearance of n may be recorded 
in P^-i, but not in P%_ 1 . This is the reason why degree 2 does not suffice 
for Pk in this dynamic setting. 

• Reset*: by inductive hypothesis, we assume that Pk-i[x,y] flips to zero 
after a Reset* operation only if no path of length up to 2 fc_1 remains in X 
between x and y. Since any P^ .Reset operation on Pk leaves it as if cleared 
l's in Pfe-i were never set to 1, Pk[x, y] flips to zero only if no path of length 
up to 2 k remains in X. 

□ 

We remark that the condition stated in Theorem |] is only sufficient because 
Pk may keep track of paths having length strictly more than 2 fc , though no longer 
than 3 fc . However, for k = log 2 n the condition is also necessary as no shortest 
path can be longer than n = 2 h . Thus, it is straightforward to see that a path of 
any length between x and y exists at any time in X if and only if Pi og2 n[x, y] = 1. 

The following theorem establishes the running time and space requirements 
of operations Init*, Set* and Reset*. 

Theorem 7 Any Init* operation can be performed in 0(n w ■ logn) worst-case 
time, where u is the exponent of matrix multiplication; any Set* takes 0(n 2 -logn) 
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amortized time. The cost of Reset* operations can be charged to previous Init* 
and Set* operations. The maximum cost charged to each Init is 0(n 3 ■ logn). 
The space required is 0(n 2 ■ logn). 

Proof. The proof follows from Theorem |]by considering the time bounds of op- 
erations on polynomials described in Section [Q[ As each maintained polynomial 
has constant degree k — 3, it follows that the space used is 0{n 2 ■ logn). □ 

Corollary 6 // we perform just one Init* operation in a sequence a of length 
Q(n), or more generally one Init operation every Q(n) Reset operations, then 
the amortized cost of Reset is 0(n 2 ■ logn) per operation. 



Corollary 7 // we perform just one Init* operation in a sequence a of length 
Q(n 2 ), or more generally one Init operation every Q(n 2 ) Reset operations, and 
we perform no operations SetRow and SetCol, then the amortized cost of Reset 
is 0(n ■ logn) per operation. 

In the traditional case where Opi =Init* and Opj ^ Init* for any i > 1, 
i.e., Init* is just performed once at the beginning of the sequence of operations, 
previous corollaries state that both Set* and Reset* are supported in 0(n 2 ■ 
logn) amortized time. In the decremental case where only Reset* operations are 
performed, the amortized time is 0{n ■ logn) per update. 

< o t> 

The algorithm that we presented in this section can be viewed as a variant 
which features very different data structures of the fully dynamic transitive clo- 



sure algorithm presented by King in [fig] 



King's algorithm is based on a data structure for a graph G = (V, E) that 
maintains a logarithmic number of edge subsets E , . . . , E\ og2n with the property 
that E = E and (x, y) G Ei if there is a path x ^ y of length up to 2* in G. 
Moreover, if y is not reachable from x in G, then (x, y) G" Ei for all < i < log 2 n. 

The maintained values of our polynomials Pq, . . . , Pi og2 n here correspond to 
the sets E , . . . ,^i og2 „. 

The algorithm by King also maintains log 2 n forests F , . . . , Fi og2 n _i such that 
Ei uses edges in Ei and includes 2n trees Outiiv) and In^v), two for each node 
v G V, such that Outi(v) contains all nodes reachable from v using at most 2 
edges in Ei, and Ini(v) contains all nodes that reach v using at most 2 edges in 
Ei. For each pair of nodes, also a table Counti is maintained, where Counti[x, y] 
is the number of nodes v such that x G Ini(v) and y G Outiiv). Now, Ei is 
maintained so as to contain edges (x,y) such that Counti_i[x, y] > 0. Trees 
Ini(v) and Outi(v) are maintained for any node v by means of deletions-only 
data structures 0] which are rebuilt from scratch after each w-centered insertion 
of edges. 
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Our data structures for polynomials over Boolean matrices Pi play the same 
role as King's forests Fj of In^ and Outi trees and of counters Counti. 

While King's data structures require 0(n 3 -log n) worst-case initialization time 
on dense graphs, the strong algebraic properties of Boolean matrices allow us to 
exploit fast matrix multiplication subroutines for initializing more efficiently our 
data structures in 0(n w ■ logn) time in the worst case, where u = 2.38. 



6 Transitive Closure Updates in 0(n 2 ) Time 

In this section we show our second and more powerful method for casting fully 
dynamic transitive closure into the problem of reevaluating polynomials over 
Boolean matrices presented in Section |4.1| . 

This method hinges upon the well-known equivalence between transitive clo- 
sure and matrix multiplication on a closed semiring and yields a new deterministic 
algorithm that improves the best known bounds for fully dynamic transitive clo- 
sure. Our algorithm supports each update operation in 0(n 2 ) amortized time and 
answers each reachability query with just one matrix lookup. The space used is 
0{n 2 ). 



6.1 Data Structure 

Let X be a Boolean matrix and let X* be its Kleene closure. Before discussing 
the dynamic case, we recall the main ideas behind the algorithm for computing 
statically X*. 

Definition 10 Let B n be the set of n x n Boolean matrices and let X e B n . 
Without loss of generality, we assume that n is a power of 2. Define a mapping 
T ' : B n — > B n by means of the following equations: 



E = (A + BD*C)* 

F = EBD* 

G = D*CE 

H = D* + D*CEBD* 



(1) 



where A, B, C, D and E, F, G, H are obtained by partitioning X and Y = F{X) 
into sub-matrices of dimension | x | as follows: 



X 



A 


B 


C 


D 



Y 



E 


F 


G 


H 



The following fact is well known [IS|: if X is an n x n Boolean matrix, then 
F(X) = X*. 

Another equivalent approach is given below: 
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Definition 11 Let B n be the set of n x n Boolean matrices, let X G B n and let 

Q : B n — > B n be the mapping defined by means of the following equations: 



E = A* + A*BHCA* 

F = A*BH 

G = HCA* 

H = (D + CA*B)* 



(2) 



where X and Y = G{X) are defined as: 

X 



A 


B 


C 


D 



Y 



E 


F 


G 


H 



It is easy to show that, for any X G B n , Q{X) = J-'(X). Both F{X) and 
Q{X) can be computed in O(n^) worst-case time [18|], where uo is the exponent 
of Boolean matrix multiplication. 

We now define another function 7i such that T~C{X) = X*, based on a new set 
of equations obtained by combining Equation (JJ) and Equation (0). Our goal is 
to define 7i is such a way that it is well-suited for efficient reevaluation in a fully 
dynamic setting. 

Lemma 5 Let B n be the set of n x n Boolean matrices, let X G B n and let 
7i : B n —>■ B n be the mapping defined by means of the following equations: 



P = D* 

Ei = (A + BP 2 C)* E 2 = E 1 BHlCE 1 

F 1 = E\BP F 2 = ExBHl 

G x = PC El G 2 = H 2 2 CE 1 

2 d d H 2 = (D + CE\B) 



H x = PCEfBP 
where X and Y = H(X) are defined as 



E = E\ + E 2 
F = F 1 + F 2 
G = G\ + G 2 
H = H 1 + H 2 



(3) 



X 



A 


B 


C 


D 



Y 



E 


F 


G 


H 



Then, for any X G B n , H{X) = X*. 

Proof. We prove that E x + E 2 , Fi + F 2 , Gi + G 2 and Hi + H 2 are sub-matrices 
of X*: 



X* 



E 1 + E 2 


Fx + F 2 


Gx + G 2 


H x + H 2 



We first observe that, by definition of Kleene closure, X = X* =^ X = X 2 . 
Thus, since E x = (A + BP 2 C)% H 2 = (D + CE 2 B)* and P = D* are all closures, 
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then we can replace E\ with Ei, iff with if 2 and P 2 with P. This implies that 
E 1 = (A + BPC)* = (A + BD*C)* and then E x = E by Equation @. Now, E is 
a sub-matrix of X* and encodes explicitly all paths in X with both end-points 
in V 1 = {1, ... , f }, and since £ 2 = + CEB)*CE, then £ 2 C £?. It follows 

that Ei + E2 = E + E2 = E. With a similar argument, we can prove that F1 + F2, 
G\ + G2 and Hi + if 2 are sub-matrices of X*. In particular, for H — Hi + H 2 we 
also need to observe that D* <Z H 2 . □ 

Note that 7i provides a method for computing the Kleene closure of an n x n 
Boolean matrix, provided that we are able to compute Kleene closures of Boolean 
matrices of size | x |. The reason of using Ef, H\ and P 2 instead of Ei, H 2 and 
P in Equation (0), which is apparently useless, will be clear in Lemma after 
presenting a fully dynamic version of the algorithm that defines TC. 

In the next lemma we show that a Divide and Conquer algorithm that recur- 
sively uses Ti to solve sub-problems of smaller size requires asymptotically the 
same time as computing the product of two Boolean matrices. 

Theorem 8 Let X be an n x n Boolean matrix and let T(n) be the time required 
to compute recursively 7i(X). Then T(n) = 0(n w ) ; where 0(n u ) is the time 
required to multiply two Boolean matrices. 

Proof. It is possible to compute E, F, G and H with three recursive calls of Tl, 
a constant number c m of multiplications, and a constant number c s of additions 
of I x I matrices. Thus: 



T(n)<3T(-) + c m M(-)+c s (-] 



where M{n) = 0{n^) is the time required to multiply two nxn Boolean matrices. 
Solving the recurrence relation, since log 2 3 < max{w, 2} = u>, we obtain that 
T(n) = 0(n w ) (see e.g., the Master Theorem in ||). □ 

The previous theorem showed that, even if TC needs to compute one more 
closure than T and Q, asymptotically the running time does not get worse. 

< o > 

In the following, we study how to reevaluate efficiently 7~t(X) = X* under 
changes of X. Our data structure for maintaining the Kleene closure X* is the 
following: 

Data Structure 4 We maintain two nxn Boolean matrices X and Y decom- 
posed in sub-matrices A, B , C , D, and E, F , G, H: 



X 



A 


B 


C 


D 



Y 



E 


F 


G 


H 
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We also maintain the following 12 polynomials over nx n Boolean matrices with 
the data structure presented in Section 



Q- 

Gi 
Hi 



A + BP 2 C 
= E\BP 
= PC El 
= PCEfBP 



E 2 
F 2 
G 2 
R = 



Ei BH 2 C Ei 
E1BH2 
■ H\CE\ 



E : 
F : 

G ■■ 
H 



Ei + E 2 
Fi + F 2 
Gi + G 2 
Hi + H 2 



D + CE\B 

and we recursively maintain 3 Kleene closures P, Ex and H 2 : 

P = D* Ei = Q* H 2 = R* 
with instances of size ^ x | of Data Structure presented in Section 



It is worth to note that Data Structure |] is recursively defined: P, E\ and H 2 
are Kleene closures of | x | matrices. Also observe that the polynomials Q, Fi, 
Gi, Hi, E 2 , F 2 , G 2 , R, E, F, G and H that we maintain have all constant degree 
< 6. In Figure |3] we show the acyclic graph of dependencies between objects 
in our data structure: there is an arc from node u to node v if the polynomial 
associated to u is a variable of the polynomial associated to v. For readability, 
we do not report nodes for the final polynomials E, F, G, H. A topological sort 
of this graph, e.g., r = (P, Q, E u R, H 2 , F u G u H u E 2 , F 2 , G 2 , E, F, G, H), 
yields a correct evaluation order for the objects in the data structure and thus 
gives a method for computing TC(X). 

We remark that our data structure has memory of all the intermediate values 
produced when computing Tl(X) from scratch and maintains such values upon 
updates of X. As it was already observed in Section [|, maintaining intermediate 
results of some static algorithm for computing X* is a fundamental idea for 
updating efficiently X* whenever X gets modified. 

Since our data structure reflects the way T~t(X) is computed, it basically repre- 
sents X* as the sum of two Boolean matrices: the first, say X^, is defined by sub- 
matrices Ei, Fx, Gi, Hi, and the second, say X%, by sub matrices E 2 ,F 2 ,G 2 ,H 2 : 



X* 



Ei 


Fi 


Gi 


Hi 



X* 



E 2 


F 2 


G 2 


H 2 



In the next section we show how to implement operations Init*, Set*, Reset* 
and Lookup* introduced in Definition |] in terms of operations Init, LazySet, 
SetRow and SetCol (see Section |4.1| ) on the polynomials of Data Structure f|. 



6.2 Implementation of Operations 

From a high-level point of view, our approach is the following. We maintain X{ 
and X 2 in tandem (see Figure |]): whenever a Set* operation is performed on X, 
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Ei 




D A 



Figure 3: Data dependencies between polynomials and closures. 

we update X* by computing how either X* or X| are affected by this change. 
Such updates are lazily performed so that neither X* x nor X\ encode complete 
information about X*, but their sum does. On the other side, Reset* operations 
update both X* and X* and leave the data structures as if any reset entry was 
never set to 1. 

We now describe in detail our implementation. To keep pseudocodes shorter 
and more readable, we assume that implicit Lookup and Lookup* operations are 
performed in order to retrieve the current value of objects so as to use them in 
subsequent steps. Furthermore, we do not deal explicitly with base recursion 
steps. 
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Init* 
0(n 2 - 38 ) 



X = 



r 



Set* 
0(n2) 



< 



Reset* 
0(n 2 ) 



m 



AX = 



AX = 



AX = 



from scratch ... 



SetRow/SetCol + 



recursion 




SetRow/SetCol + 



recursion 



Reset + 



recursion 



E , =(A+BP 2 C)* 



F j = E 2 BP 
Gj = PCE* 
Hj =PCE^BP 



E 2 


— E jBH 2 CE ^ 


F 2 








G 2 


= h|ce! V 


H 2 


= (D+CEjB)* 



Figure 4: Overview of operations Init*, Set* and Reset*. 



Init* 



procedure Init*(Z) 

1. begin 

2. X «- Z 

3. P. Init* (D) 

4. Q. Init (A,B,P,C) 

5. E x .Init*(Q) 

6. R.Ivit(D,C,Ei,B) 

7. H 2 .Init*(i?) 

8. Fi.Init(£;i,B,P) 

9. { similarly for Gi, H u E 2 , F 2 , G 2 , and then for E, F, G, H } 

10. end 



Init* sets the initial value of X (line 2) and initializes the objects in Data 
Structure |] according to the topological order r of the graph of dependencies as 
explained in the previous subsection (lines 3-9). 

Set* 

Before describing our implementation of Set*, we first define a useful shortcut for 
performing simultaneous SetRow and SetCol operations with the same i on more 
than one variable in a polynomial P: 
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procedure P . Set(i, AX%, . . . , AX q ) 

1. begin 

2. P.SetRow(i, AX 1 ,X 1 ) 

3. P.SetCol(i, AXi,Xi) 

4. : 

5. P.SetRow(i, AX q ,X q ) 

6. P.SetCol(i, AX q ,X q ) 

7. end 

Similarly, we give a shortcut^ for performing simultaneous LazySet operations 
on more than one variable in a polynomial P: 

procedure P . LazySet (AX\, . . . , AX q ) 

1. begin 

2. P . LazySet {AX\ , X\ ) 

3. : 

4. P.LazySet(AX 9 ,Xg) 

5. end 

We also define an auxiliary operation LazySet* on closures that performs LazySet 
operations for variables A, B, C and D on the polynomials Q, R, F\, G\, Hi, Ei, 
i*2, and G2 and recurses on the closure P which depend directly on them. We 
assume that, if M is a variable of a polynomial maintained in our data structure, 
AM = M curr — Moid is the difference between the current value M curr of M and the 
old value M oW of M. 

procedure LazySet* (AX) 

1. begin 

2. X ^ X + AX 

3. Q. LazySet (AA, AB, AC) 

4. R. LazySet (AB, AC, AD) 

5. { similarly for Fx, Gx, Hx, E2, F2, and G2 } 

6. P. LazySet* (AD) 

7. end 

Using the shortcuts Set and LazySet and the new operation LazySet*, we are now 

ready to define Set*. 

2 For the sake of simplicity, we use the same identifier LazySet for both the shortcut and 
the native operation on polynomials, assuming to use the shortcut in defining Set*. 
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procedure Set*(i, AX) 



1. 


begin 


2. 


X <— X + /ax,! + Jax,% 


3. 


if 1 < j i < | then 


4. 


Q.Set(i, AA, AS, AC) 


5. 


Ei.Set*(i, AQ) 


6. 


Fi . Set(z, AEi,AB) 


7. 


Gi . Seth, AC, AP X ) 


8. 


Hi.Set(i, AC, AE 1 ,AB) 


9. 


R.Set(i, AC, AE X , AB) 


10. 


H 2 .LazySet*(AP) 


11. 


G 2 .LazySet(AC, AP X ) 


12. 


F 2 .LazySet(A£i, AB) 


13. 


E 2 . LazySet(A£i, A£>, AC) 


14. 


else {f + l<j<n} 


15. 




16. 


P.Set*(i, AD) 


17. 


R.Set(i, AB, AC, AD) 


18. 


H 2 .Set*(i, AR) 


19. 


G 2 . Set(i, AH 2 , AC) 


20. 


F 2 .Set(i,AB,AH 2 ) 


21. 


E 2 . Set(i, AB, AH 2 , AC) 


22. 


Setd AB AP AC) 


23. 


Ei .LazySet*(AQ) 


24. 


Fi.LazySet(A J B, AP) 


25. 


Gi . LazySet(AP, AC) 


26. 


Hi . LazySet(AP, AP, AC) 


27. 


E.Init(Pi,P 2 ) 


28. 


F.Init(Pi,P 2 ) 


29. 


G.Init(Gi,C 2 ) 


30. 


H. Initio, H 2 ) 


31. 


end 



Set* performs an z-centered update in X and runs through the closures and the 
polynomials of Data Structure |] to propagate any changes of A, B, C, D to E, 
F, G, H. The propagation order is (Q, E x , F x , G x , H x , R, H 2 , G 2 , F 2 , E 2 , 
E, F, G, H) if 1 < % < § and (P, R, H 2 , G 2 , F 2 , E 2 , Q, E x , Pi, G x , H x ) if 
^ + 1 < i < n and is defined according to a topological sort of the graph of 
dependencies between objects in Data Structure f| shown in Figure [| 

Roughly speaking, Set* updates the objects in the data structure according 
to the value of i as follows: 

1. If 1 < i < f, fully updates Q, R, E x , Pi, G u Pi (lines 4-9) and lazily 
updates E 2 , F 2 , G 2 , H 2 (lines 10-13). See Figure || (a). 
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2. If 2 + 1 < i < n, fully updates P, Q, R, E 2 , F 2 , G 2 , H 2 (lines 16-22) and 
lazily updates Ex, Fx, Gx, Hi (lines 23-26). See Figure || (b). 

We highlight that it is not always possible to perform efficiently full updates 
of all the objects of Data Structure f|. Actually, some objects may change every- 
where, and not only in a row and column. Such unstructured changes imply that 
we can only perform lazy updates on such objects, as they cannot be efficiently 
manipulated by means of z-centered SetRow and SetCol operations. 

We now explain in detail the operations performed by Set* according to the 
two cases 1 < i < | and ^ + 1 < i < n. 

Case 1: 1 < i < §. 

In this case an i-centered update of X may affect the i-th row and the i-th column 
of A, the i-th row of B and the i-th column of C, while D is not affected at all 
by this kind of update (see Figure |]). The operations performed by Set* when 
1 < i < f are therefore the following: 

Line 2: an i-centered set operation is performed on X. 

Line 4: Q = A + BP 2 C is updated by performing SetRow and SetCol operations 
for any variables A, B and C being changed. P = D* does not change since, 
as already observed, D is not affected by the change. Notice that new l's 
may appear in Q only in the i-th row and column due to this operation. 

Line 5: Set* is recursively called to propagate the changes of Q to Ex. We 
remark that Ex may change also outside the i-th row and column due to 
this operation. Nevertheless, as we will see in Lemma || the fact that Ex 
is a closure implies that new l's appear in a very structured way. This will 
make it possible to propagate changes efficiently to any polynomial that, in 
turn, depends on E\. 

Lines 6—9: polynomials Fx, Gx, Hi and R are updated by performing SetRow 
and SetCol operations for any variables Ex, B and C being changed. We 
recall that such operations take into account only the entries of AEx lying 
in the i-th row and in the i-th column, albeit other entries may be non-zero. 
Again, Lemma ^| and Lemma |7| will show that this is sufficient. 

Lines 10—13: H 2 = R* is not updated, but new l's that appear in R are lazily 
inserted in the data structure of H 2 by calling LazySet*. Then LazySet 
operations are carried out on polynomials G 2 , F 2 , E 2 to insert in the data 
structures that maintain them any new l's that appear in C, Ex and B. 

Lines 27—30. Recompute polynomials E, F, G and H from scratch. This is 
required as Fx, Gx and H 2 may change everywhere and not only in a row 
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and a column. Differently from the case of Ei, whose change is structured 
as it is a closure, we cannot exploit any particular structure of AFi, AG X 
and AH2 for reducing ourselves to use SetRow and SetCol and we are forced 
to use Init. Note that, since E, F, G and H have all degree 1, this is not 
a bottleneck in terms of running time. 

Case 2: § + 1 < i < n. 

In this case an z'-centered update of X may affect only the z-th row and the z-th 
column of D, the z-th row of C and the z-th column of B, while A is not affected 
at all by this kind of update (see Figure 

Operations performed by Set* are completely analogous to the case 1 < i < ~ , 
except for the fact that we need to rescale the index z in line 15 and we have also 
to perform a recursive call to update P in line 16. 

Reset* 

Before describing our implementation of Reset*, we define a useful shortcut^ 
for performing simultaneous Reset operations on more than one variable in a 
polynomial P. 

procedure P . Reset(AXi, . . . , AX q ) 

1. begin 

2. P.Reset(AXi,Xi) 

3. : 

4. P.Reset(AX g ,X 9 ) 

5. end 

Using this shortcut, we are now ready to define Reset*. We assume that, if M is 
a variable of a polynomial maintained in our data structure, AM = — M curr 
is the difference between the value M id of M just before calling Reset* and the 
current value M curr of M. 

procedure Reset* (AX) 

1. begin 

2. X <- X - AX 

3. P.Reset*(AL>) 

4. Q . Reset(AA, A.B, AP, AC) 

5. Ei.Reset*(AQ) 

6. R.Reset(AZ), AC, AEi, AB) 

7. H 2 . Reset* (AR) 

8. Fi . Reset(A£i, AB, AP) 

9. { similarly for G x , H x , E 2 , F 2 , G 2 , and then for E, F, G, H } 

10. end 

3 For the sake of simplicity, we use the same identifier Reset for both the shortcut and the 
native operation on polynomials, assuming to use the shortcut in defining Reset*. 
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Reset* resets any entries of X as specified by AX and runs through the closures 
and the polynomials in the data structure to propagate any changes of A, B, C, 
D to E, F, G, H. The propagation is done according to a topological order r 
of the graph of dependencies shown in Figure |] and is the same order followed 
by Init*, which has a similar structure. Actually, we could think of Reset* as a 
function that "undoes" any previous work performed by Init* and Set* on the 
data structure, leaving it as if the reset entries of X were never set to 1. 

Lookup* 

procedure Lookup*(x,y) 

1. begin 

2. return Y[x, y] 

3. end 

Lookup* simply returns the maintained value of Y[x, y\. 
6.3 Analysis 

Now we discuss the correctness and the complexity of our implementation. Before 
providing the main claims, we give some preliminary definitions and lemmas that 
are useful for capturing algebraic properties of the changes that polynomials in 
our data structure undergo during a Set* operation. 

The next definition recalls a property of Boolean update matrices that is 
related to the operational concept of 2-centered update. 

Definition 12 We say that a Boolean update matrix AX is i — centered if AX = 
Iax,i + Jax,i, i-G., all entries lying outside the i-th row and the i-th column are 
zero. 

If the variation AX of some matrix X during an update operation is i-centered 
and X is a variable of a polynomial P that has to be efficiently reevaluated, then 
we can use P . SetRow and P . SetCol operations which are especially designed for 
doing so. But what happens if X changes by a AX that is not i-centered? Can 
we still update efficiently the polynomial P without recomputing it from scratch 
via Init? This is the case of Ei and AEi while performing a Set* update with 
1 < z < ~. In the following we show that, under certain hypotheses on X and AX 
(which are satisfied by E\ and AEi), we can still solve the problem efficiently. 

While the property of being i-centered is related to an update matrix by itself, 
the following two definitions are concerned with properties of an update matrix 
AX with respect to the matrix X to which it is applied: 

Definition 13 If X is a Boolean matrix and AX is a Boolean update matrix, 
we say that AX is i-transitive with respect to X if I ax, i — Iax,i • X and Jax,« = 
X ■ Jax,i- 
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Definition 14 If X is a Boolean matrix and AX is a Boolean update matrix, 
we say that AX is i-complete with respect to X if AX = JAx,r^Ax,t + ^'^Ax,i + 
Jaxj ■ X . 

Using the previous definitions we can show that the variation of X* due to 
an i-centered update of X is i-transitive and i-complete. 

Lemma 6 Let X be a Boolean matrix and let AX be an i-centered update matrix. 
If we denote by AX* the matrix (X + AX)* — X* , then AX* is i-transitive and 
i-complete with respect to X* . 

Proof. The following equalities prove the first condition of i-transitivity: 

lAX*,i ■ X* = I(X+AX)*~X*,i • X* = I(X+AX)*-X*-X*-X*,i = I(X+AX)*-X*,i = IaX*,i- 

The other conditions can be proved analogously. The hypothesis that AX is 
i-centered is necessary for the i-completeness. □ 

The following lemma shows under what conditions for AX and X it is possible 
to perform operations of the kind X <— X + AX on a variable X of a polynomial 
by reducing such operations to i-centered updates even if AX is not i-centered. 

Lemma 7 If X is a Boolean matrix such that X = X* and AX is an i-transitive 
and i-complete update matrix with respect to X, then X + AX = (X + I A x,i + 

JAX,i) 2 - 

Proof. Since X = X* it holds that X = X 2 and X = X + I AX ,i • Jax,i- 
The proof follows from Definition [13] and Definition and from the facts that: 
Jlx,i C Ja X>1 and AX = AX + I AX>i + J AX>i . □ 

It follows that, under the hypotheses of Lemma if we replace any oc- 
currence of X in P with X 2 and we perform both P.SetRow(i, /ax,*) X) and 
P.SetCol(i, Jax,i, X), then new l's in P correctly appear. This is the reason 
why in Data Structure § we used Ef, H%, and P 2 instead of Ex, H 2 , and P, 
respectively. 

Before stating the main theorem of this section which establishes the cor- 
rectness of operations on our data structure, we discuss a general property of 
polynomials and closures over Boolean matrices that will be useful in proving the 
theorem. 

Lemma 8 Let P and Q be polynomials or closures over Boolean matrices and 
let P and Q be relaxed functions such that P(X) C P(X) and Q(Y) C Q(Y) for 
any values of variables X and Y . Then, for any X: 

Q{P{X)) c Q(P(X)) 
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Proof. Let Y = P{X) and Y = P(X). By definition, we have: Y C Y 
and QiY) C (3(Y). By exploiting a monotonic behavior of polynomials and 
closures over Boolean matrices, we have: Y C Y =>- Q(Y) C Q(Y). Thus: 
Q(Y) C Q(Y) C QiY) QiY) C Q(Y) Q(P(X)) C Q(P(X)). □ 

Theorem 9 Lei be the function defined in Lemma [| ; let X and Y be the 
matrices maintained in Data Structure 0, and let M be a Boolean matrix whose 
value at any time j is defined as: 

Mj = ]T nix^-mx^). 

1 < i < j ■ 
0p t ^ LazySet* 

If we denote by Xj and Yj the values of X and Y after the j-th operation, respec- 
tively, then the relation Mj C Yj C TC(Xj) is satisfied. 



Proof. The proof is by induction on the size n of matrices in Data Structure 
The base is trivial. We assume that the claim holds for instances of size -| and 
we prove that it holds also for instances of size n. 



Opj=Init*: since Init* performs Init operations on each object, then Yj = 

n{Xj). 

Opj=Set*: we first prove that Yj C TC(Xj). Observe that Y is obtained 
as a result of a composition of functions that relax the correct interme- 
diate values of polynomials and closures of Boolean matrices in our data 
structure allowing them to contain less l's. Indeed, by the properties of 



Lookup described in Section |4.1| , we know that, if P is the correct value of 
a polynomial at any time, then P.Lookup() C P. Similarly, by inductive 
hypothesis, if K is a Kleene closure of an | x | Boolean matrix, then at 
any time K . Lookup* (x, y) — 1 =>■ K[x, y] = 1. The claim then follows by 
Lemma || which states that the composition of relaxed functions computes 
values containing at most the l's contained in the values computed by the 
correct functions. 

To prove that Mj C Yj, based on the definition of M, it suffices to verify that 
AH(X) C AY, where AH(X) = H{X j )-H{Xj_ l ) and AY = Yj-Y 3 _ x . In 
particular, we prove that if H[x,y) flips from to 1 due to operation Set*, 
then either flips from to 1 (due to lines 4-8 when 1 < % < |), or 

X%[x, y] flips from to 1 (due to lines 17-21 when | + 1 < i < n). 

Without loss of generality, assume that the Set* operation is performed 
with 1 < i < ^ (the proof is completely analogous if ^ + 1 < % < n). 

As shown in Figure |], sub-matrices A, B and C may undergo i-centered 
updates due to this operation and so their variation can be correctly propa- 
gated through SetRow and SetCol operations to polynomial Q (line 4) and 
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to polynomials Fx, G\ and Hx (lines 6-8). As AQ is also i-centered due to 
line 4, any variation of Q, that is assumed to be elsewhere correct from pre- 
vious operations, can be propagated to closure Ex through a recursive call 
of Set* in line 5. By the inductive hypothesis, this propagation correctly 
reveals any new l's in E\. We remark that Ex may contain less l's than E 
due to any previous LazySet operations done in line 23. 

Observe now that Ex occurs in polynomials Fx, Gx and Hi and that AEi 
is not necessarily i-centered. This would imply that we cannot propagate 
directly changes of Ex to these polynomials, as no efficient operation for 
doing so was defined in Section |4.1|. However, by Lemma 01 AEi is i- 



transitive and i-complete with respect to Ex- Since Ex = E*, by Lemma [7| 
performing both SetRow(z, I&Ei,i, Ex) and SetCol(z, Jasi.i, Ex) operations 
on data structures F 1; Gi and Hi in lines 6-8 is sufficient to correctly reveal 
new l's in Fx, Gx and Hx- 

Again, note that Fx, Gx and Hi may contain less l's than F, G and H, re- 
spectively, due to any previous LazySet operations done in lines 23-26. We 
have then proved that lines 4-8 correctly propagate any i-centered update 
of X to X*. 

To conclude the proof, we observe that Ex also occurs in polynomials E 2 , 
F 2 , G 2 , R and indirectly affects H 2 . Unfortunately, we cannot update 
H 2 efficiently as AR is neither i-centered, nor i-transitive/i-complete with 
respect to R. So in lines 9-13 we limit ourselves to update explicitly R and 
to log any changes of E\ by performing LazySet operations on polynomials 
G 2 , F 2 , and E 2 and a LazySet* operation on H 2 . This is sufficient to 
guarantee the correctness of subsequent Set* operations for | + 1 < i < n. 

0pj=Reset*: this operation runs in judicious order through the objects 
in the data structure and undoes the effects of previous Set* and Init* 
operations. Thus, any property satisfied by Y still holds after performing 
a Reset* operation. 

□ 



Corollary 8 Let X be an instance of Data Structure [| and let a = (X.0p 1 , . . . , 
X.0p fc ) be a sequence of operations on X. If Op^ ^ LazySet* for all 1 < % < j < k , 
then Mj = H(Xj). 

Proof. Since 7Y(0 n ) = 0* = n , the proof easily follows by telescoping the sum 
that defines M f Mj = H{Xj) - H{X^x) + W(^i-i) - + • • • + H{X 2 ) - 

H(Xx) + H(Xx) - H(X ) = H(Xj) - H(X ) = H(Xj). □ 

To conclude this section, we address the running time of operations and the 
space required to maintain an instance of our data structure. 
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Theorem 10 Any Init* operation can be performed in 0(n w ) worst-case time, 
where u is the exponent of matrix multiplication; any Set* takes 0(n 2 ) amor- 
tized time. The cost of Reset* operations can be charged to previous Init* and 
Set* operations. The maximum cost charged to each Init* is 0(n 3 ). The space 
required is 0{n 2 ). 

Proof. Since all the polynomials in Data Structure |] are of constant degree and 
involve a constant number of terms, the amortized cost of any SetRow, SetCol, 
LazySet, and Reset operation on them is quadratic in | (see Theorem |j). Let 
T(n) be the time complexity of any Set*, LazySet* and Reset* operation. Then: 

m/ \ m /^N cn 2 
T{n) < 3T(-) + 



2' 4 

for some suitably chosen constant c > 0. As log 2 3 < 2, this implies that T(n) = 
0(n 2 ). 

Init* recomputes recursively TC from scratch using Init operations on poly- 
nomials, which require 0(n £J ) worst-case time each. We can then prove that the 
running time of Init* is 0(n u ) exactly as in Theorem || 

To conclude the proof, observe that if K(n) is the space used to maintain all 
the objects in Data Structure 0], and M(n) is the space required to maintain a 



polynomial with the data structure of Section 4.1, then: 



n 

K(n) < 3K(-) + 12M(n). 
Since M(n) = 0(n 2 ) by Theorem g, then K(n) = 0(n 2 ). □ 

Corollary 9 // we perform just one Init* operation in a sequence a of length 
Q(n), or more generally one Init* operation every Q(n) Reset* operations, then 
the amortized cost of Reset* is 0(n 2 ) per operation. 



Corollary 10 // we perform just one Init* operation in a sequence a of length 
Q(n 2 ), or more generally one Init* operation every Q(n 2 ) Reset* operations, 
and we perform no Set* operations, then the amortized cost of Reset* is 0(n) 
per operation. 

In the traditional case where Opi =Init* and Op^ ^Init* for any % > 1, i.e., 
Init* is performed just once at the beginning of the sequence of operations, previ- 
ous corollaries state that both Set* and Reset* are supported in 0{n 2 ) amortized 
time. In the decremental case where only Reset* operations are performed, the 
amortized time is 0{n) per update. 
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7 Breaking Through the 0(n 2 ) Barrier 



In this section we present the first algorithm that supports both updates and 
queries in subquadratic time per operation, showing that it is actually possible 
to break through the 0(n 2 ) barrier on the single-operation complexity of fully 
dynamic transitive closure. This result is obtained by means of a new technique 
that consists of casting fully dynamic transitive closure into the problem of dy- 
namically maintaining matrices over integers presented in Section [4.2| . As already 
shown in Section [5] and in Section |6], dynamic matrices, thanks to their strong 
algebraic properties, play a crucial role in designing efficient algorithms for the 
fully dynamic transitive closure problem. 

The remainder of this section is organized as follows. In Section |7.1| we present 
a subquadratic algorithm for directed acyclic graphs based on dynamic matrices 
that answers queries in 0(n e ) time and performs updates in 0{n UJ ^ 1,e ' x '~ e + n l+e ) 
time, for any < e < 1, where uj(1, e, 1) is the exponent of the multiplication of 
annxn' matrix by an n € x n matrix. According to the current best bounds on 
uu(l,e,l), we obtain an 0(n°' 58 ) query time and an 0(n L58 ) update time. The 
algorithm we propose is randomized, and has one-side error. 

7.1 Counting Paths in Acyclic Directed Graphs 

In this section we study a variant of the fully dynamic transitive closure problem 
presented in Definition |I] and we devise the first algorithm that supports both 
update and query in subquadratic time per operation. In the variant that we 
consider, the graph that we maintain is constrained to be acyclic; furthermore, 
Insert and Delete operations work on single edges rather than on set of edges. 
We shall discuss later how to extend our algorithm to deal with more than one 
edge at a time. 

Definition 15 Let G = (V,E) be a directed acyclic graph and let TC(G) = 
(V, E') be its transitive closure. We consider the problem of maintaining a data 
structure G for the graph G under an intermixed sequence a = (G.0p 1 , . . . , G.0p fc ) 
of update and query operations. Each operation G.Op,, on the data structure G can 
be either one of the following: 

• G. Insert (x, y): perform the update E <— EU {(x,y)}, such that the graph 
obtained after the update is still acyclic. 

• G.Delete(x, y): perform the update E <— E — {(x,y)}, where (x,y) G E. 

• G . Query(.x, y): perform a query operation on TC(G) by returning 1 if 
(x, y) G E' and otherwise. 

In this version of the problem, we do not deal explicitly with initialization 
operations. 
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Data Structure 



In [16) King and Sagert showed that keeping a count of the number of distinct 
paths between any pair of vertices in a directed acyclic graph G allows it to 
maintain the transitive closure of G upon both insertions and deletions of edges. 
Unfortunately, these counters may be as large as 2 n : to perform 0(1) time arith- 



metic operations on counters, an 0{n) wordsize is required. As shown in the 



wordsize can be reduced to 2c lg n for any c > 5 based on the use of arithmetic 
operations performed modulo a random prime number. This yields a fully dy- 
namic randomized Monte Carlo algorithm for transitive closure with the property 
that "yes" answers on reachability queries are always correct, while "no" answers 
are wrong with probability 0{\). We recall that this algorithm performs reach- 
ability queries in 0(1) and updates in 0(n 2 ) worst-case time on directed acyclic 
graphs. 

We now present an algorithm that combines the path counting approach of 
King and Sagert with our technique of implicit matrix representation. Both 
techniques are very simple, but surprisingly their combination solves a problem 
that has been open for many years. 



Data Structure 5 We keep a count of the number of distinct paths between any 
pair of vertices in graph G by means of an instance M of the dynamic matrix 
data structure described in Section ££J. We assume that M[x,y] is the number 
of distinct paths between node x and node y in graph G. Since G is acyclic, this 
number is well-defined. 



Implementation of Operations 

We now show how to implement operations Insert, Delete and Query in terms of 
operations Update and Lookup on our data structure as described in Section W72 
We assume all arithmetic operations are performed in constant time. 



Insert 

procedure Insert(x,y) 

1. begin 

2. E^EU{(x,y)} 

3. for z = 1 to n do 

4. J[z] <— M.Lookup(z, x) 

5. I[z] <— M.Lookup(y, z) 

6. M. Update (J, I) 

7. end 



Insert first puts edge (x, y) in the graph and then, after querying matrix M, 
computes two vectors J and / such that J[z\ is the number of distinct paths 
2 -v> x in G and I[z\ is the number of distinct paths y ~> z in G (lines 3-5). 
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Finally, it updates M in line 6. The operation performed on M is M <— M + J ■ I: 
this means that the number M[u, v] of distinct paths between any two nodes 
(u, v) is increased by the number J[u] of distinct paths u ~» x times the number 
I[v] of distinct paths y ~> v , i.e., M[u, v] <— M[u, v] + J[u) ■ I[v]. 

Delete 

procedure Delete(x,y) 

1. begin 

2. E<-E-{(x,y)} 

3. for z = 1 to n do 

4. J[z] <— M.Lookup(z, x) 

5. I[z] M.Lookup(y, z) 

6. M.Update(-J,7) 

7. end 

Delete is identical to Insert, except for the fact that it removes the edge (x,y) 
from the graph and performs the update of M in line 6 with —J instead of J. 
The operation performed on M is M <— M — J ■ I: this means that the number 
M[u, v] of distinct paths between any two nodes (u, v) is decreased by the number 
J[u] of distinct paths u ~» x times the number I[v] of distinct paths y ~» v, i.e., 
M[u, v] <- M[u, v] - J[u] ■ I[v\. 

Query 

procedure Query(x, y) 

1. begin 

2. if M. Lookup(x, y) > then return 1 

3. else return 

4. end 

Query simply looks up the value of M[x, y] and returns 1 if the current number 
of distinct paths between x and y is positive, and zero otherwise. 

< o > 

We are now ready to discuss the running time of our implementation of op- 
erations Insert, Delete, and Query. 

Theorem 11 Any Insert and any Delete operation can be performed in 0(n u) ( 1 ' € ' 1 ^~ e + 
n 1+e ) worst-case time, for any < e < 1, where u>(l, e, 1) is the exponent of the 
multiplication of an n x n e matrix by an n e x n matrix. Any Query takes 0(n € ) 
in the worst case. The space required is 0(n 2 ). 

Proof. We recall that, by Theorem |5|, each entry of M can be queried in 0(n e ) 
worst-case time, and each Update operation can be performed in 0{n^ 1,e,1 '~ e ) 
worst-case time. Since I and J can be computed in 0(n 1+e ) worst-case time 
by means of n queries on M, we can support both insertions and deletions in 
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0( n w(l,e,l)-« + n l+e) worgt 

-case time, while a reachability query for any pair of 
vertices (x, y) can be answered in 0(n € ) worst-case time by simply querying the 
value of M[x, y). □ 

Corollary 11 Any Insert and any Delete operation requires 0(n 158 ) worst- 
case time, and any Query requires 0(n°' 58 ) worst-case time. 

Proof. Balancing the two terms in the update bound 0{n w< - 1,€,1 '~ e + n 1+t ) yields 
that e must satisfy the equation u(l,e,l) = 1 + 2e. The current best bounds 
on u>(l,e, 1) [§, [H| imply that e < 0.58 |2(|. Thus, the smallest update time is 
0(n 1 ' 58 ), which gives a query time of 0(n a58 ). □ 

The algorithm we presented is deterministic. However, as the numbers in- 
volved may be as large as 2 n , performing arithmetic operations in constant time 
requires wordsize 0(n). To reduce wordsize to O(logn) while maintaining the 
same subquadratic bounds (0(n 1,58 ) per update and 0(n a58 ) per query) we per- 
form all arithmetic operations modulo some random prime number as explained 
in [16]. Again, this produces a randomized Monte Carlo algorithm, where "yes" 
answers on reachability queries are always correct, while "no" answers are wrong 
with probability 0{\) for any constant c > 5. 

It is also not difficult to extend our subquadratic algorithm to deal with 
insertions/deletions of more than one edge at a time. In particular, we can 
support any insertion/deletion of up to 0(n 1 ~ T? ) edges incident to a common 
vertex in 0(n^^ 1 ' e ' 1 - ) ~ <E + n 2- '* 7-6 )) worst-case time. We emphasize that this is still 
o(n 2 ) for any 1 > 77 > e > 0. Indeed, rectangular matrix multiplication can 
be trivially implemented via matrix multiplication: this implies that u)(l,e,l) < 
2 — (2 — u)e, where uo = U)(l, 1, 1) < 2.38 is the current best exponent for matrix 
multiplication M. 



8 Conclusions 

In this paper we have presented new time and space efficient algorithms for main- 
taining the transitive closure of a directed graph under edge insertions and edge 
deletions. As a main contribution, we have introduced a general framework for 
casting fully dynamic transitive closure into the problem of dynamically reevalu- 
ating polynomials over matrices when updates of variables are performed. Such 
technique has turned out to be very flexible and powerful, leading both to re- 



visit the best known algorithm for fully dynamic transitive closure |1| from a 
completely different perspective, and to design new and faster algorithms for the 
problem. 

In particular, efficient data structures for maintaining polynomials over Boolean 
matrices allowed us to devise the fairly complex deterministic algorithm described 
in Section^, which supports updates in quadratic amortized time and queries with 
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just one matrix lookup. Our algorithm improves the best bounds for fully dy- 
namic transitive closure achieved in [H| and is the fastest algorithm with constant 
query time known in literature for this problem. 

In addition, a surprisingly simple technique for efficiently maintaining dy- 
namic matrices of integers under simultaneous updates of multiple entries, com- 
bined with a previous idea of counting paths in acyclic digraphs [fL6] , yielded the 
randomized algorithm presented in Section |7.1] : this algorithm, for the first time 
in the study of fully dynamic transitive closure, breaks through the 0(n 2 ) barrier 
on the single-operation complexity of the problem. 



Acknowledgements 

We are indebted to Garry Sagert and Mikkel Thorup for enlightening discussions, 
and to Valerie King for many useful comments and insights on this work. 



References 

[1] A.V. Aho, J.E. Hopcroft, and J.D. Ullman. The Design and Analysis of 
Computer Algorithms. Addison Wesley, 1974. 

[2] D. Coppersmith and S. Winograd. Matrix multiplication via arithmetic 
progressions. Journal of Symbolic Computation, 9:251-280, 1990. 

[3] T.H. Cormen, C.E. Leiserson, and R.L. Rivest. Introduction to Algorithms. 
The MIT Press, 1990. 

[4] C. Demetrescu. Fully Dynamic Algorithms for Path Problems on Directed 
Graphs. PhD thesis, Department of Computer and Systems Science, Uni- 
versity of Rome "La Sapienza", February 2001. 

[5] C. Demetrescu and G.F. Italiano. Fully dynamic transitive closure: Breaking 
through the 0(n 2 ) barrier. In Proc. of the 41st IEEE Annual Symposium on 
Foundations of Computer Science (FOCS'00), pages 381-389, 2000. 

[6] S. Even and Y. Shiloach. An on-line edge-deletion problem. Journal of the 
ACM, 28:1-4, 1981. 

[7] M. J. Fischer and A. R. Meyer. Boolean matrix multiplication and transitive 
closure. In Conference Record 1971 Twelfth Annual Symposium on Switch- 
ing and Automata Theory, pages 129-131, East Lansing, Michigan, 13-15 
October 1971. IEEE. 



51 



[8] M.E. Furman. Application of a method of fast multiplication of matrices in 
the problem of finding the transitive closure of a graph. Soviet Math. Dokl. , 
11(5), 1970. English translation. 

[9] M. Henzinger and V. King. Fully dynamic biconnectivity and transitive 
closure. In Proc. 36th IEEE Symposium on Foundations of Computer Science 
(FOCS'95), pages 664-672, 1995. 

[10] X. Huang and V.Y. Pan. Fast rectangular matrix multiplication and appli- 
cations. Journal of Complexity, 14(2):257-299, June 1998. 

[11] T. Ibaraki and N. Katoh. On-line computation of transitive closure for 
graphs. Information Processing Letters, 16:95-97, 1983. 

[12] G. F. Italiano. Amortized efficiency of a path retrieval data structure. The- 
oretical Computer Science, 48(2-3) :273-281, 1986. 

[13] G. F. Italiano. Finding paths and deleting edges in directed acyclic graphs. 
Information Processing Letters, 28:5-11, 1988. 

[14] S. Khanna, R. Motwani, and R. H. Wilson. On certificates and lookahead 
on dynamic graph problems. In Proc. 7th ACM-SIAM Symp. Discrete Al- 
gorithms, pages 222-231, 1996. 

[15] V. King. Fully dynamic algorithms for maintaining all-pairs shortest paths 
and transitive closure in digraphs. In Proc. 40th IEEE Symposium on Foun- 
dations of Computer Science (FOCS'99), 1999. 

[16] V. King and G. Sagert. A fully dynamic algorithm for maintaining the 
transitive closure. In Proc. 31st ACM Symposium on Theory of Computing 
(STOC'99), pages 492-498, 1999. 

[17] J. A. La Poutre and J. van Leeuwen. Maintenance of transitive closure 
and transitive reduction of graphs. In Proc. Workshop on Graph-Theoretic 
Concepts in Computer Science, pages 106-120. Lecture Notes in Computer 
Science 314, Springer- Verlag, Berlin, 1988. 

[18] I. Munro. Efficient determination of the transitive closure of a directed graph. 
Information Processing Letters, l(2):56-58, 1971. 

[19] D. M. Yellin. Speeding up dynamic transitive closure for bounded degree 
graphs. Acta Informatica, 30:369-384, 1993. 

[20] U. Zwick. All pairs shortest paths in weighted directed graphs - exact and 
almost exact algorithms. In Proc. of the 39th IEEE Annual Symposium on 
Foundations of Computer Science (FOCS'98), pages 310-319, Los Alamitos, 
CA, November 8-11 1998. 



52 



